Gene Francci3_3430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3430 
Symbol 
ID3905670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4078669 
End bp4079997 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content72% 
IMG OID637880753 
Producttransposase 
Protein accessionYP_482513 
Protein GI86742113 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACGC TCCAGGCGTA CCGCTTCGCC CTCGACCCGA ACCAGGCCCA GCTCGCCGGT 
ATCCGCCGCC ACGCCGGGGC GTCGAGGTTC GCCTACAACT GGGGCCTGGC CCGGGTGAAG
GCCGCGCACG CCCAGCGTGA CGCCGAGCAG TCCTACGGGC TGACGGGCGA CCTGCTCACC
CCCGTCCCGT GGACGCTCCC CGCGCTGCGC CTCGCGTGGA ACGCGGTCAA GCGGGACATC
GCCCCATGGT GGGACGAGTG CTCGAAGGAG GCGTTCCGCG CCGGGCTCGA CCAGCTTGCC
CGCGGGTTGA AGAACTTCAC CGACTCCCGG CAAGGGAAGC GGAAGGGCCG GCGGGTCGGC
TTTCCCAGGT TCAAGAAGCG GGGAAAGGCC CGTGACTCGT TCCGGTACAC CACCGGCGCC
TACGGCCCGG CTGACGAGAC CTACGTGAAG CTGCCCCGGA TCGGGCGGGT GAAGGTCCAC
GAGCCGATGG GCGCCCTGAC CGGCCGGCTG GCCGACGGAC GTGCCCGACT GTTCGGCGTG
ACCGTGTCCC GGACTGCTGA CCGCTGGTTC GTGTCGTTCA CCGTCGAGGT CGACCGCGAC
GTTCCCGAGC GGCCGTCGCG ACGGCAACGC GCGGGCGGCC CGGTGGGCGT CGACCTCGGC
GTGAAGCACC TCGCTGTTCT GTCGACCGGG CAGACCGTTC CCAACCCGAA GCACTACCAG
CGGGCCGAAC GGCGACTGCG CCGGGCGTCG CGGGCCCACG CCCGGTCGAA GCCGGGCAGC
GCTGGGCGGC GGCAGCGCGC CGCCCAGCTC GCGACGATCC ATGTCCGGGT CGCGAACCAG
CGCCACAACG GGCTGCACAA GCTCACGACC CGGCTCGCCC GGTCCCACGA CACGGTCGTC
GTCGAAGATC TGCACGTCGC CGGGATGGTC CGTAACCGGC GGCTCGCCCG CGCGGTCGCC
GACGCCGGCA TGGCCGAGGT CCGCCGGCAG CTTGCCTACA AGACCCGCTG GTACGGATCG
ACGCTCGTCG TCGCCGACCG CTGGTATCCC AGCTCGAAGA CCTGCTCCGG CTGCGGCTGG
CGAAACCCAA GCCTGACGCT GTCCGAGCGC ACCTTCACCT GCCAGTCCTG CGGGCTGGTA
CTCGACCGCG ACCACAACGC CGCGATCAAC CTGCACCACC TCGTCGCCGC CAGTACACCG
GAGACGGAAA ACGCCCGTGG AGCCGACCGT AAGACCCGCG CAAGCGGGCG GGTGGCTGGG
AAGCGGGAAC CCGGCACGGC CACGGCCGGT CAGACCGGGG GTGCCTCGCC GAGAGGCGAG
GCGGCATGA
 
Protein sequence
MTTLQAYRFA LDPNQAQLAG IRRHAGASRF AYNWGLARVK AAHAQRDAEQ SYGLTGDLLT 
PVPWTLPALR LAWNAVKRDI APWWDECSKE AFRAGLDQLA RGLKNFTDSR QGKRKGRRVG
FPRFKKRGKA RDSFRYTTGA YGPADETYVK LPRIGRVKVH EPMGALTGRL ADGRARLFGV
TVSRTADRWF VSFTVEVDRD VPERPSRRQR AGGPVGVDLG VKHLAVLSTG QTVPNPKHYQ
RAERRLRRAS RAHARSKPGS AGRRQRAAQL ATIHVRVANQ RHNGLHKLTT RLARSHDTVV
VEDLHVAGMV RNRRLARAVA DAGMAEVRRQ LAYKTRWYGS TLVVADRWYP SSKTCSGCGW
RNPSLTLSER TFTCQSCGLV LDRDHNAAIN LHHLVAASTP ETENARGADR KTRASGRVAG
KREPGTATAG QTGGASPRGE AA