Gene Francci3_3364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3364 
Symbol 
ID3905946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3993078 
End bp3994181 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content74% 
IMG OID637880687 
Productalcohol dehydrogenase GroES-like protein 
Protein accessionYP_482448 
Protein GI86742048 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0600954 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGGT CGCGCGTTCT TCGTCCACCG GCTGGGCGAC AGCCCGAATC CGGACGCATG 
ATCATGACCG ATGACGACAT GCTGGCCCTG GTGGTCCGGG GGCACCGTGA CCACGGCCTG
GAGCGGCGTC CCCGGCCCGA GCCCGGTCCG GGCGAGGCCC TGGTCGCGAC CGACTTCGCC
GCCATGTGCG GTACCGACCT GCGGTTGCTG GACGGGACCC TGCACGACGC CGAGTACCCG
GTGATTCCCG GGCACGAGTG GTCCGGCACG GTGCTGGCGG CCCCCGACCG CCCCGAGCTG
GTGGGCCGGG CGGTGGTGGG CGACAACTTC CGGCTCTGCG GCCGGTGCCC CGCCTGCCTC
GCGGGACGGC CGAACCTGTG CACCGACATC GACGAGGTGG GCTTCACCCG CCCGGGTGCC
TTCGCCCAGC TGTTCACCAT CCCCGCCGCC AACCTGGTCG CGCTGCCGCC ACAGGTCCCG
GGCCCCCAGG CCTGCCTGCT GGAGCCGCTC GGCGTGGCCC TGCACGCCGT GGAACGGGCC
GGGGCGGTGT CCGGCCGGTC GGTCGGCGTG ATCGGCGCCG GAACGATCGG CCTGCTCGTC
GCCCAGCTGG CCCGCGGCGC CGGGGCGTCC CGGGTACGGG TGGCCGATCC CCTGCAGTCC
CGCCGGCGGA TCGCCGCCGA CCTCGGCGTC GACGCCGACC CGGGTATCGA GGGCTGGCAC
CCGGACCTGC CGGAGGTTGT CTTCGACGCC ACCGGAGCGG CCGGCGTGTT CCCCCGTGGC
CTGACGGCGA CCGCGGTCGG CGGCGTCTAC GTGCTCGTGG GCTACTCGGG AGCCGAGGCG
GTCACCGTCG AACCGAGCAC GGTGATGCTG CGCGAACTGA CCGTGCAGGG TGTGCTGTCC
GGTCAGGGGC AGCTGCGCAC CGCCCTGGCC AAGGTCGTTG CCGGCGAGGT CCGCCTGGGT
CCCCTGACCG GCGACCCGGT CCCGCTGACC TCGTACCGCA GTGTGCTCGA GCGCGACGGA
CCGGCACCGT TGCGCCTCTT CTTCCACGTC GGCGGCGACC GGACGGCCGG CGCCACCAGC
CAGGAGCAGG GGGTAACGGC ATGA
 
Protein sequence
MARSRVLRPP AGRQPESGRM IMTDDDMLAL VVRGHRDHGL ERRPRPEPGP GEALVATDFA 
AMCGTDLRLL DGTLHDAEYP VIPGHEWSGT VLAAPDRPEL VGRAVVGDNF RLCGRCPACL
AGRPNLCTDI DEVGFTRPGA FAQLFTIPAA NLVALPPQVP GPQACLLEPL GVALHAVERA
GAVSGRSVGV IGAGTIGLLV AQLARGAGAS RVRVADPLQS RRRIAADLGV DADPGIEGWH
PDLPEVVFDA TGAAGVFPRG LTATAVGGVY VLVGYSGAEA VTVEPSTVML RELTVQGVLS
GQGQLRTALA KVVAGEVRLG PLTGDPVPLT SYRSVLERDG PAPLRLFFHV GGDRTAGATS
QEQGVTA