Gene Francci3_1089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1089 
Symbol 
ID3905760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1301636 
End bp1302808 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content72% 
IMG OID637878422 
Producttransposase, IS4 
Protein accessionYP_480199 
Protein GI86739799 
COG category[L] Replication, recombination and repair 
COG ID[COG5659] FOG: Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG CTATCGGGCG CTCTTATGCA ACGGCTGCTG ACCGACGGCA AGCGCAAGAG 
CATGGTCCCG ATGGCCGCCC GCCTCGGGGT GGACCCTCAG CAGCTGCAGC AGTTCGTCAC
CAGCTCGACC TGGGACTACC GCCAGGTCCG GCGGCGGCTG ACGGGCTGGG CGGCCGGGTT
CTGCGACCCG GTCGCGCTGG TCGTGGACGA CACCGGCTTT CCCAAGGACG GGCCGCCTCC
CCCGGGGTGG CCCGGATGTA CTCCGGCACC CTGGGCAAGG TCGGGAACTG TCAGATCGGG
GTGTCCGTGC ACGCGGTCAC CGACTGGGCG TCAGCCGCGG TGGCCTGGCG GCTGTTCCTC
CCGACCTGCT GGGACGACAC CACCCTGACC GACCCGACCG AGGTCGCCGC CGCCCGGGCC
CGGCGGGAAC GGGCCGCGAT CCCCGACAAG GCGCGGCACC GGGAGAAATG GCGGCTCGCC
CTGGACATGA TCGACGAACT GGCCGGCTGG GGCATGCCTG TGCGGCCGGT CGTGGCGGAC
GCCGGCTACG GCGACGCCGC TGCCTTCCGC CAGGGGCTGA CCGACCGGAA CATCCCCTAC
GTGCTGGCGG TGAAGCCGAC CGCGACCGCC TACCCGGCCG ACGCGGTGCC GGTCACCGCC
CCCTACCCGG GAAACAGCCG GCGGCCCACA CCCGCCTACC CCGACCCGCC CCGCGATCTG
AAATCCCTGG TCATGGCCGC CGGCCGCCGC GCCGGCCGGT CTGTGACCTG GCGTCACGGC
ACCCACCGGA CCCCGGCCAA CCCGACCGCG GGGATGCGGT CCCGCTTCCT CGCGCTCCGG
GTCCGCCCCG CGGGCCGGAA CATCACCCGT AACCCCGACC GGAGCCTGCC CGTCTGCTGG
CTGCTCGCCG AATGGCCAGT CGGCCAGCCC GAACCCACCG ATTACTGGCT GTCCACCCTG
CCCACCGGCA TCCCCCTGCG CGATCTTGTC CGTCTCGCGA AGATCCGCTG GCGGATCGAA
CACGACTACC GCGAGTTGAA AGACGGCCTC GGCCTCGACC ACTTCGAGGG CCGAACCTTC
GCCGGCTGGC ACCGTCACGT CACCCTCGTC AGGGTCGCCC AAGCCCTCTG CACCCAGCTG
AGACGAACCC CAAAAGTCCC TGCGCCGGCC TGA
 
Protein sequence
MSAAIGRSYA TAADRRQAQE HGPDGRPPRG GPSAAAAVRH QLDLGLPPGP AAADGLGGRV 
LRPGRAGRGR HRLSQGRAAS PGVARMYSGT LGKVGNCQIG VSVHAVTDWA SAAVAWRLFL
PTCWDDTTLT DPTEVAAARA RRERAAIPDK ARHREKWRLA LDMIDELAGW GMPVRPVVAD
AGYGDAAAFR QGLTDRNIPY VLAVKPTATA YPADAVPVTA PYPGNSRRPT PAYPDPPRDL
KSLVMAAGRR AGRSVTWRHG THRTPANPTA GMRSRFLALR VRPAGRNITR NPDRSLPVCW
LLAEWPVGQP EPTDYWLSTL PTGIPLRDLV RLAKIRWRIE HDYRELKDGL GLDHFEGRTF
AGWHRHVTLV RVAQALCTQL RRTPKVPAPA