Gene Francci3_1864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1864 
Symbol 
ID3906139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2198759 
End bp2200399 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content71% 
IMG OID637879202 
Producttransposase IS66 
Protein accessionYP_480969 
Protein GI86740569 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.164551 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGTTC TGTCTGTCAC CGATGATGTC ACCGAGGTGG CGTACTGGCG TGGGCGTGCC 
GAGCGGGCCG AGGAGTGTGC GGAGAAAGCC GAGGCCCGTG TCGGGCAGCT GCAGCTGCGG
GTCGAGGAGT TGAGCGAGCA GGTCGCGGTG CTGTCCCGGA TGCTGTTCGG TCGTTCCTCG
GAGAAGACCG GCCCGTCGTC GGCTGTGGAT GAGAAACCAG AAGATCGGCA GGATTCGGGC
GGTGGGGATG CCGGCCGGCC GGCGCGTCAA CGCGGGCAGC GGCCGGGGAG CCGGGGGCAT
GGCCGGCGGG ACTACTCGCA TCTGCAGACC CGCGAGGAGA TCCATGATGT GCCCGAGGTC
GACCGTGCCT GCCCCGGGTG TGGGGTGGCG TTCACGCCGT TGGGGACCGA CGACAGCGAA
CAGGTCGACT GGCAGGTCGT GATCACCCGG ATCGTGCATC GGCGGCGGCG GTATCGGCGG
TGCTGCACAT GTCCGGGGCC GCGGACAGTG ACCGCGCCGG TGCCACCCAA ACCGATTCCC
AAGGGCCGGT TCACCGCAGG GTTCCTCGCC CGCCTTCTCT ACGAGAAGTA CGTCCTGGGC
CTGCCGTTGC ACCGGATCGC TCGGGCGCTG GCCGCCGACG GGCTCGGTGT TGCCGAGGGC
ACTCTGTGTG GGGCGTTGAA GGACGTGCAT GGACTGCTCG GCGGGCTCGA TGAGCAGATC
GTGGCGCGTA ACGCCGCCGC CGGTCATGTC CACGCGGACG AGACGACGTG GCGGGTGTTC
GAGCGGGTCG AGGGCAAGGA CGGGACCCGC TGGTGGCTGT GGGTGTTCGT CGCCGCCGAC
ACGGTGGTGT TCCGGATGGA CCCGACCCGC TCGGCTGCCC CGGTCGAGAA GCACTTCGGG
ATCGACCGGG CCGCCGGGGC GCTGTCCGAC GGACGTCGCC TCGTCGTCTC GTCGGACTTC
TACACCGTCT ACCAGTCCCT GGGCCGCGTC GACGGAGTCG ACCCGCTCTG GTGCTGGGCA
CACATCCGCC GGTACTTCAT CCGGGCCGGG GACGCCCACC CCCAACTGCG GTACTGGGCC
GACCAGTGGG TCGCCCGGAT CGGGATGCTC TACCTCGCTC ACCGCGCCCT CGCCGCCGAG
CAGCCCACAA CCGGCGGCTA CCGCGAGGCC GCCGGCGCGT TCGAGGCCGC GCTGAGGGCG
ATCGACACGG CGCGGCGCGC GGAGGCGGCG ATCCACAGCC TGCACCCGGC GGCGAAGAAG
GTCCTGGCGA CCCTGGACCG GGAATGGGAC GGGCTGGCCC GCCACCAGGA CTTCCCCGAC
CTGGATCTTG ACAACAATGC TGCCGAGAGA GCGCTACGGA CCCCGGTCGT CGGGCGGAAG
AACTACTACG GCGCACACGC TGAGTGGGCC GCGCACCTCG CCGCCCGGGT CTGGACCATC
GTCGCCACCG CGGAGCGTAA CGGCCGTGAA CCCCTCGCGT TCCTGACCGG CTACCTGAAC
GCCTGCGCCA CAGCCGGCGG GAAAGCACCC GCCGGCCCCG CCCTCGAACC CTTCCTCACC
TGGCAGACCA CCACCCAGAC CGGCAGCCCT CCCAGCACCG ACCCACCCCA GGACGGCCCA
CCCGACGGGC CCGAGCCCTA A
 
Protein sequence
MSVLSVTDDV TEVAYWRGRA ERAEECAEKA EARVGQLQLR VEELSEQVAV LSRMLFGRSS 
EKTGPSSAVD EKPEDRQDSG GGDAGRPARQ RGQRPGSRGH GRRDYSHLQT REEIHDVPEV
DRACPGCGVA FTPLGTDDSE QVDWQVVITR IVHRRRRYRR CCTCPGPRTV TAPVPPKPIP
KGRFTAGFLA RLLYEKYVLG LPLHRIARAL AADGLGVAEG TLCGALKDVH GLLGGLDEQI
VARNAAAGHV HADETTWRVF ERVEGKDGTR WWLWVFVAAD TVVFRMDPTR SAAPVEKHFG
IDRAAGALSD GRRLVVSSDF YTVYQSLGRV DGVDPLWCWA HIRRYFIRAG DAHPQLRYWA
DQWVARIGML YLAHRALAAE QPTTGGYREA AGAFEAALRA IDTARRAEAA IHSLHPAAKK
VLATLDREWD GLARHQDFPD LDLDNNAAER ALRTPVVGRK NYYGAHAEWA AHLAARVWTI
VATAERNGRE PLAFLTGYLN ACATAGGKAP AGPALEPFLT WQTTTQTGSP PSTDPPQDGP
PDGPEP