Gene Francci3_2183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2183 
Symbol 
ID3906783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2556883 
End bp2558121 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content72% 
IMG OID637879516 
Producttransposase, IS4 
Protein accessionYP_481282 
Protein GI86740882 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.208464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGCATG ACGGGAGTGA GGGGCTGTCG CCGGAGGGCT GGTTGCCGGA CCGGGTGACG 
GTGGGTGTGT TGACGCGGGT GTACCCACCG GAGCTGGTGG ACCGGGTGTT GGCGGTGACC
GACACCGCGG AGGTTCGCCG CCGGCTGTTG CCGTCGTGGC TGGTGGTGTA CTTCGTGTTG
GCGTTGTGGC TGTTCCGGGG CCGGAACTGT GGCTATGTGC AGGTGCTGGC CCGGTTGACC
AGCGGGCTGC ACTTCCAGCG CCGGGCGGCT GTGTTGGCCG CGGGTGGGGC GGGTGGGGCG
GGTTGGTCGC TGCCGGCAAG CCCGTCGCTG GGCGAGGCGC GGGCACGGAT CGGGTCGGAT
CCGGTGCGGA TGCTGTTCGA GCACGCCGCG GGGCCGGTCG GTGTCGAGGG CCAGGCCGGG
GTGTTCCTGC ACGGCCTGCG GCTGGTGCAG ATCGACGGGT CGACCTGCGA TCTGCCGGAC
ACGCAGGCCA ACCGGGCGTT CTTCCCCGGG CCGTCGAACG CCGGTGGGCC GGCGCCGTTC
CCGAAGGTTC GCTGGGTCAT CGCCGCCGAG GCCGCCACCG GTGCCCTGCT GGGGGCGTCG
TTCGGCCCGT GGAGCACCGG CGAGCCGGCG CTGGCCCGTG ACCTGCTGGG GCAGCTGGGC
CCGGGCATGC TCACGTTGGC GGACCGTAAC TTCCTGTCCC ACCGGCTCGC CGGCGAGGTC
CTGGCGACCG GGGCGCACCT GCTGTGGCGG GCGAAGGCGA CCTTCACGCT GGCCCCGGTC
CACGTCCTCG ACGATGGCAG CTACCTCGCC GAGCTGACAC CTCCCCGCGG AAGCGAGGGA
CCACCACTGA CGATGCGGGT GATCGAGTAC ACCGTGCACT CCACCACCGC CGGCGGCGAC
GAGAGCAGCT CAGAGCTGTT CTGTCTGGTC ACCGACCTGC TCGACCCCGA GGAATGGTCC
ATGCTCGACC TGGCCCGCGC CTACCCGACC AGGTGGGGAT GCGAGACCGT GATCGGCCAC
CACAAGACCG ATCTCGGTGA AGGCCGACCC GTCCTGCGCA GCAAGGACCC CGAAGGCGTC
GCCCAGGAGA TGTGGGCCCT GTTCGCCGTC CACCAGGCAC TCGCCCGCCT CATCGGTGTC
GCTGCCGACA CCACCGGCAC CCCACCCGAC AGGATCAGTT TCCGTCGGGC CCTCACCGCC
GCGTCCGATT CCATCGGGAC CGCGGCTTTC CCCCCCTGA
 
Protein sequence
MGHDGSEGLS PEGWLPDRVT VGVLTRVYPP ELVDRVLAVT DTAEVRRRLL PSWLVVYFVL 
ALWLFRGRNC GYVQVLARLT SGLHFQRRAA VLAAGGAGGA GWSLPASPSL GEARARIGSD
PVRMLFEHAA GPVGVEGQAG VFLHGLRLVQ IDGSTCDLPD TQANRAFFPG PSNAGGPAPF
PKVRWVIAAE AATGALLGAS FGPWSTGEPA LARDLLGQLG PGMLTLADRN FLSHRLAGEV
LATGAHLLWR AKATFTLAPV HVLDDGSYLA ELTPPRGSEG PPLTMRVIEY TVHSTTAGGD
ESSSELFCLV TDLLDPEEWS MLDLARAYPT RWGCETVIGH HKTDLGEGRP VLRSKDPEGV
AQEMWALFAV HQALARLIGV AADTTGTPPD RISFRRALTA ASDSIGTAAF PP