Gene Francci3_1817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1817 
Symbol 
ID3906208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2152020 
End bp2153603 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content72% 
IMG OID637879155 
Producthypothetical protein 
Protein accessionYP_480922 
Protein GI86740522 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGA CCAGCCAGCG CGTCGAGCAG GAAGAGCTGC GGGCGCGGAT GCGCGCGGTC 
GGCATGTCCC ACGACGAGAT CGCGACCGAG TTCGCCCGCC GTTATAGCTA CCGTCCCCGC
GCCGCCCACC GGGTCGCGCA CGGCTGGACC CAGGCGCAGG CCGCTGGGCA CATCAACGCT
CATGCCGCCC GGGTTGGCCT CGACCCGGGC GGCGCCGCTC CGTTGACCGG GCCGGGGCTA
TGCGAACGGG AGCTGTGGCC GCAACCGAAC AACCGCCGCC GGCCCACACC GCAGTTCCTC
GCGCTGCTCG CCGAGGTCTA TGGCACCAGT ATCCACAACC TGATCGACCT CGATGACCGC
GAACAGATGC CCCCAGCAGA CCTGCTCCTC ATCAGCACGA TCCGCGTGGA GGGCGTGCCC
GCGAGCGCTG TCGGTTCCCT TGCGCGCACC AAGGCGGTGG ACGTCGAGGC GCCGCTGATG
ACCGCATTGA ACCGGCAGCA GTTCCTGCTC GCCGTGTCGA TCTCGGGGGT TGCCGCGGTG
ACACCCCGAC AACCGGCCGC GCCTGCCGTG TCGGTAAACA CCAGTAGGGC ACCGGCCGGG
GACGATCTTC TTGCTGATCT GCGGGAGGCC GTCACGGTTC CAGCGGAGTG GTCCGCTGAC
CCTGGCGTGA CCTCGGCGGG TGCCTTTGCT GACCTTGAGG CGCGGGGGCG GGAGTGCCAC
AACCGGTATC AGCGGGCCGA CTATGCTGGG GCGGCGCGGC TCCTTCCCGC GGTGGTGCGG
GGTATCGACA CGCTGGTCGC TGATTCGCCG GCGGGCGTGG ACCACCGCGC GGTCCGGCGG
TCTCAGGCTG TCGCGTACAT CGCTGCCGCC AAGCTCGCGA CCAGGACCGG CGACCACGAT
CTAGCGTGGC TGGCCGCGGA CCGTGGTCAA CACGCGGCGC TCGCCGCCGA CACGCCAGTG
CTTCTGGCGA CCGCACGCAG GCAGATCGCC TGCGTCTTCC ACGACACAGG GCGGCTGGCC
GACGCCGAAC GGGTCGCCCT CAGCGCACTC GACGCCCTGA ACCGGCGGCG AGGCGACGAG
GACCACCCCG ACATCGTTTC CGCGCGGGGC GCCCTTCTCC TGCTCGCGGC GATGACCTCG
GTCCGTCAGG GTGAGCGGGC GCAAGCCCGC CGCCGGCTCA CCGCCGCGGC CGAGCAGGCG
GGCACGCTGG GCCGGGACGA CAACCGGTTG TGGTCGGCGT TCGGGCCGAC GAACGTGGCG
ATCCACACCC TCACCGCGGC CCTGACGCTC GACGATCCGA CGGAGGCGGT CGCCGTCGGC
GAACAGATCG ACACGCGTCT GCTGCCTCCC CCGCTGGTGG GCAGGCGTGC GCGATTGCAC
CTAGATCTTG CCGACGGGCA CGCCCGCCTG GGCGAGGACG CCGTCGCGGC CGTGCACATC
CTCGACGTCG CCCGGCGAGC TCCACAGCTG CTGCGGGTCG ACCCGACCGC TCGGGCGGTG
CTGGCGACGC TGCTGAGCCG CGCCCGCGGT TCCATCGTCT CTGTCCTGCG GGACGTCGCC
GAGCAGGCCG GAGTCGCGGC GTGA
 
Protein sequence
MGKTSQRVEQ EELRARMRAV GMSHDEIATE FARRYSYRPR AAHRVAHGWT QAQAAGHINA 
HAARVGLDPG GAAPLTGPGL CERELWPQPN NRRRPTPQFL ALLAEVYGTS IHNLIDLDDR
EQMPPADLLL ISTIRVEGVP ASAVGSLART KAVDVEAPLM TALNRQQFLL AVSISGVAAV
TPRQPAAPAV SVNTSRAPAG DDLLADLREA VTVPAEWSAD PGVTSAGAFA DLEARGRECH
NRYQRADYAG AARLLPAVVR GIDTLVADSP AGVDHRAVRR SQAVAYIAAA KLATRTGDHD
LAWLAADRGQ HAALAADTPV LLATARRQIA CVFHDTGRLA DAERVALSAL DALNRRRGDE
DHPDIVSARG ALLLLAAMTS VRQGERAQAR RRLTAAAEQA GTLGRDDNRL WSAFGPTNVA
IHTLTAALTL DDPTEAVAVG EQIDTRLLPP PLVGRRARLH LDLADGHARL GEDAVAAVHI
LDVARRAPQL LRVDPTARAV LATLLSRARG SIVSVLRDVA EQAGVAA