Gene Francci3_1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1822 
Symbol 
ID3906213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2159649 
End bp2160875 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content72% 
IMG OID637879160 
Producthypothetical protein 
Protein accessionYP_480927 
Protein GI86740527 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.961315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCAG GCACCGCGGA AGACACCGGC CGCGACACGC TCTCCAGCCG GGCACTGGCC 
GCCTCAGCGG CAATCGCGGA CCAGCTCGCC GACCCCGCCG CCGTTCCCCG AGGCCCTGGC
CGGCGGCGAG GCCAGTCCCT CGCGGGCGGT GCAGCCGGCA TCGCGCTGCT CCACCTCGAA
CGAGCCCGCA CCGGCCACGG CGACCCCGCC ATCGCCAACG CCTGGCTACG GGCGGCAACC
CGAGACCCAG CCAGCGCAGG GCCCAACGCC TCCCTCTACT TCGGCGCCCC CACCCTCGCC
TTCGTACTCG ACGCCGCAGG CCGCCCAGAC CAGCTCACCC AAGCGGTCAC CACACTCGAC
ACCGCGACCA TCGCGGTCAC ACAGCGACGA CTTGCCGCGG CTGACGCCCG CCTCCGCGCG
GGCCTCCGCC CGCCGCTCGC CGAATTCGAC CTCATCCGCG GCCTCGCCGG CCTCGGCCGC
TACCACCTAC GCCGCCAGCA CCCGATCATC ACCGATGTCC TGACCCACCT CGTCCGTCTC
ACTCAGCCCC CTGCCAGCGG AGACGGACTA CCCGGGTGGT GGACCGACCT CGACCCCAGC
GGCGCATCCT CCCGTGACTA CCCACACGGT CATGGAAATG CCGGAATGTC CCACGGCATC
GCCGCCTGCC TCGCCCTGCT CGCACTAGCC AACAGCCGCG GCACCGAGGT CGACGGCCAC
CGGGAAGCGA TCGAACGGAT CTGCGCCTGG CTCGACGGGC ACCAGCAGCC TGGCATCGCT
GGCTGGCCGG GCATCGTCAC CCCGACACCG GGCCATGACG TCAGGCAGCA GCGGTTGTCC
TGGTGCTACG GCACCCCAGG GATTGCCCGT GCGTACCAAC TCGCTGGACT CGCCACCAGT
GACCCGAGGC GCTGCGAGAA GGCCGAAACC GCGTTGCACG CATGTCTCGA CGACGCGGCT
CGCCTCGAAC TGACCACCGA CATCGGCTTG TGCCACGGCC TGGCAGGGCT GGTCCACACG
ACATCCCGGG TCGCGGCCGA CGCCACCACA CCCGAACTCG CCCAGCTCCT GCCCACCCTC
GTCGCCCGGC TGCTCGACCA ATACCCCACC GCGCCGCACG ACCCGGAGCT TCTCGATGGC
TTGGCCGGGG TGGCGCTTGC CCTGCACACC GCGGCCCTCG GCTCCGCCCC CGTGACCGGG
TGGGACGCCG CCTTGCTCCT CGCGTGA
 
Protein sequence
MNPGTAEDTG RDTLSSRALA ASAAIADQLA DPAAVPRGPG RRRGQSLAGG AAGIALLHLE 
RARTGHGDPA IANAWLRAAT RDPASAGPNA SLYFGAPTLA FVLDAAGRPD QLTQAVTTLD
TATIAVTQRR LAAADARLRA GLRPPLAEFD LIRGLAGLGR YHLRRQHPII TDVLTHLVRL
TQPPASGDGL PGWWTDLDPS GASSRDYPHG HGNAGMSHGI AACLALLALA NSRGTEVDGH
REAIERICAW LDGHQQPGIA GWPGIVTPTP GHDVRQQRLS WCYGTPGIAR AYQLAGLATS
DPRRCEKAET ALHACLDDAA RLELTTDIGL CHGLAGLVHT TSRVAADATT PELAQLLPTL
VARLLDQYPT APHDPELLDG LAGVALALHT AALGSAPVTG WDAALLLA