Gene Francci3_2944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2944 
Symbol 
ID3903759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3477679 
End bp3479199 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content67% 
IMG OID637880265 
Productaldehyde dehydrogenase 
Protein accessionYP_482031 
Protein GI86741631 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.225427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTACG TCCCACCTGG AAAGCCAGGC AGTATCGTCC ATGTTACGGA CCGGTACGAG 
AACTTCATCG GTGGAAAGTG GCTGCCGCCG CGGGCCGGTA AGTATTCGAC CAACCTCAGC
CCGGCCACGG CGCAGCCGAT CTGTGAAATA CCGCGGTCGG CGGCGGAGGA TGTCGAGGAC
GCCCTGGACG CGGCGCACGC CGCCGCCGGC TCCTGGGCCG CCGCGTCCCC GGCGGAGCGG
GCCGAGGTGC TGACCGCGGT GGCGGATGCG ATCGACGCCA ACCGGGAGAT GCTCGCGGTC
ACGGAGAGCT GGGAGAACGG CAAGCCGGTG CGGGAGACGC TGGCTGCCGA CATCCCGCTG
GCCGCCGACC ACTTCCGGTA CTTCGCCGCG GCGGCGCGCA CCCTGGAGGG GTCGATCTCC
GAGATTGACG GCAGGACCTA CGCCTATCAC TTCCACGAGC CGCTCGGGGT GGTCGGGCAG
ATCATCCCCT TCAACTTCCC GTTGCTGATG GCGGCGTGGA AGCTGGCGCC CGCGCTGGCC
GCGGGTAACT GCTCGGTGAT CAAGCCGGCG TCACCGACGC CGTGGTCGAT CCTCAAGCTC
GCCGAGGTGA TTCAGGACGT CATCCCGCCC GGCATCATCA ACATCGTCAA CGGGCCCGGC
GCCGAGGTCG GCAGAGCCCT GGCCACCAGC CCGAAGATCG CGAAGATCGG GTTCACCGGT
GAGACCACGA CCGGCCGGCT GATGATGCAG TACGCGGCGC GGAACATCAT CCCGGTCACC
CTGGAGCTGG GCGGAAAGTC CCCGAACATC TTCTTCGAGG ACGTCCTGGC CGCCGACGAC
GCCTATCTGA ACAAGGCGGT CGAGGGGCTG GTGCTGTATG CGTTCAACAA GGGCGAGGTG
TGCACCTGCC CGTCCCGGGC GCTGATCCAG GAGTCGATCT ACGACGAGTT CATGGCGCGG
GCGCTGGAAC GGGTCGGCCG GATCCAGCAG GGCAACCCGT TGGACCCGGC GACAATGCTG
GGCCCGCAGG TCTCCGCGCA GCAGCTGTCA AAGATCGGCT CCTACGTGGA CATCGGCCTG
GCTGAGGGCG CCGAACTGCT GGCCGGTGGG CACCGGACGC GGCTGGCCGG AGAGTTCGCG
AACGGCTACT TCTTCGAGCC GACGCTGCTG AAGGGCCACA ACAAGATGCG GGTCTTCCAG
GAGGAAATCT TCGGCCCGGT GCTGGCTGTC ACCACCTTCA AGGACGAGGC CGAGGCGTTG
GCCATCGCCA ACGACACGCC TTACGGGCTC GGTGCCGGTG TGTGGACGCG GGACGGTGCC
CGCCAGTTCC GGATGGGCCG GGGCATCAAG GCCGGCCGGG TCTGGGTCAA CTGCTACCAC
CAGTATCCGG CGGGCGCGGC ATTCGGCGGA TACAAGGTGT CCGGCATCGG CCGCGAAAAC
CACCGGATGA TGCTGGAGCA CTACAGCCAG ACCAAGAACA TGCTGGTCAG CTACGACGAG
AACGCGCTCG GCCTGTTCTA G
 
Protein sequence
MTYVPPGKPG SIVHVTDRYE NFIGGKWLPP RAGKYSTNLS PATAQPICEI PRSAAEDVED 
ALDAAHAAAG SWAAASPAER AEVLTAVADA IDANREMLAV TESWENGKPV RETLAADIPL
AADHFRYFAA AARTLEGSIS EIDGRTYAYH FHEPLGVVGQ IIPFNFPLLM AAWKLAPALA
AGNCSVIKPA SPTPWSILKL AEVIQDVIPP GIINIVNGPG AEVGRALATS PKIAKIGFTG
ETTTGRLMMQ YAARNIIPVT LELGGKSPNI FFEDVLAADD AYLNKAVEGL VLYAFNKGEV
CTCPSRALIQ ESIYDEFMAR ALERVGRIQQ GNPLDPATML GPQVSAQQLS KIGSYVDIGL
AEGAELLAGG HRTRLAGEFA NGYFFEPTLL KGHNKMRVFQ EEIFGPVLAV TTFKDEAEAL
AIANDTPYGL GAGVWTRDGA RQFRMGRGIK AGRVWVNCYH QYPAGAAFGG YKVSGIGREN
HRMMLEHYSQ TKNMLVSYDE NALGLF