Gene Francci3_2078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2078 
Symbol 
ID3905605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2444291 
End bp2446024 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content75% 
IMG OID637879414 
Producttransposase IS66 
Protein accessionYP_481180 
Protein GI86740780 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0408516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.686964 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGCGGT GCGTGACGGT TGTCGAGTCG GGGGCGGGCG CTGCCGCGAG CGGTGAGGTT 
GCCGAGGGCG CGGCGCTGCT GGCGGAGAAC GCCTGGCTGC GGGCCCGGGT CGCGGAGCTG
TTGACGGACA TCGCCGGGCT GGTCGCGCGG GAGGCGACGC GTGAGGCCGA GGTGGTGGAG
CTGCGTCTCC AGCTCGAGGC GTTGCAGGCG GAGCTGGCGA CGTTGCGGCG GATGCTGTTC
GGCCGGTCGT CGGAACGGGA GTGCGGCGGG TCGCCGGCCG TGGGTTCGCC GGATGGCGGG
GACGGTTGTG GCGACGGGGC GCGGGGCGAG GCCGCCGGGT CGGCAGGCCG GCGGCGGGGG
CCGGGCGCGC GCTCGGGCCG GCGGAGCTAC GACCATCTGT CCCGCGACGA GGTCGACTGC
GACTTCGAGG GCGGGGGCTA TGGCTGCCTG TCGTGTGGGC AGCCGTTCAC GCCGTGGGGC
GAGCATGTCG TCGAGCAGCT CGACTGGCTG GTGACGGTGC GGGTTCGGGT GTCGAGGCGG
CGCCGGTATC GGCGGGGCTG CCGCTGTGGC GGGTCGTTGA CGGTGACCGC GCCGGGACCG
TCGAAGGCGA TCGGGAAGGG CCTGTTCACG CACCGGTTCC TCGCGATGCT GATCGTGGAG
CGCTATGTCG CGGGCCGTTC GCAGAACTCG CTGGTCACCG GGTTGGCCCG GCACGGCGCC
CAGCTCTCGC CGGCGACGCT GACCGGGGCG TGCGCCCAGG TCGCGGGCCT GCTCGCCCCA
CTCGCCGAGC AGATCGTCGG GCGGTCGCGG GGGTCGTGGC ACCTGCACGC CGACGAGACG
ACCTGGCGGG TGTTCACCCC GACCGGCGGC GGCGGGCCGG CCCGCTGGTG GCTGTGGGTG
TTCCTGGGGC CGGACAGCGT CTGTTTCGTG ATGGACGCGA CCCGCTCGAC GGCGGTGCTC
GCCGAACACG TCGGCCTCGA CCCGGACAGC GGCCAGCTGA CCGACGACGC CGACGGCGGA
CCGCGCCGCC TCGTGCTGTC GTCGGACTTC TACACCGTGT ACGTCTCCGC CGGCCGCCGC
GCCGATGGCC TGGTCAACCT GTACTGCTGG GCGCACGCGC GGCGGTACTT CGTGCGGGCC
GGCGACGCGA ACCCCGCCCA GCTCGGGATC TGGGCCCGCC AGTGGGTCGA GCGGATCCGC
GCGCTCTACA CCGCGCACGG CGAGCTCGCC GCCGCCTGGC ACACCGCCGC CGCGGCCCCG
TCGCCGGCCA CCGAGAAGCG GCTCGCCGCC GCGTACGCCG GCTGGGACAC CGCGATCACC
GTGATCGACA CGGTTCGCCG CGAGCAGACG GCCTCGCCCG GCCTGCAGGA ACCCGCGCGC
AAGGCGCTCG CGACCCTGGA CCGGGAATGG GACGGGCTGG TCGCCCACCG CGACTACCCC
ATGATCGGCA TGGACAACAA CCCGGCGGAA AGGGCGATCA GGGGCCCGGT CGTGACCCGG
CGCAACGCCG GCGGCTCCCG CACCGAGGAC ACCGCCCGCC ACGCCGCCAC GATCTTCACG
GTCACCGCGA CCGCCGCGAT GCACAACCTG AACCTGCTGA CCTACCTGGA GAACTACCTC
GACGCCTGCG GCCGGGCCGG CGGCAAGCCG CCGACCGGCG CCGACCTCGA CCGGTTCCTG
CCCTGGGCCG CCAGCCCCGA GGACCTCACC ACCTGGCAAC AGCCTCCCGG CTGA
 
Protein sequence
MLRCVTVVES GAGAAASGEV AEGAALLAEN AWLRARVAEL LTDIAGLVAR EATREAEVVE 
LRLQLEALQA ELATLRRMLF GRSSERECGG SPAVGSPDGG DGCGDGARGE AAGSAGRRRG
PGARSGRRSY DHLSRDEVDC DFEGGGYGCL SCGQPFTPWG EHVVEQLDWL VTVRVRVSRR
RRYRRGCRCG GSLTVTAPGP SKAIGKGLFT HRFLAMLIVE RYVAGRSQNS LVTGLARHGA
QLSPATLTGA CAQVAGLLAP LAEQIVGRSR GSWHLHADET TWRVFTPTGG GGPARWWLWV
FLGPDSVCFV MDATRSTAVL AEHVGLDPDS GQLTDDADGG PRRLVLSSDF YTVYVSAGRR
ADGLVNLYCW AHARRYFVRA GDANPAQLGI WARQWVERIR ALYTAHGELA AAWHTAAAAP
SPATEKRLAA AYAGWDTAIT VIDTVRREQT ASPGLQEPAR KALATLDREW DGLVAHRDYP
MIGMDNNPAE RAIRGPVVTR RNAGGSRTED TARHAATIFT VTATAAMHNL NLLTYLENYL
DACGRAGGKP PTGADLDRFL PWAASPEDLT TWQQPPG