Gene Francci3_2784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2784 
Symbol 
ID3904930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3277160 
End bp3278905 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content74% 
IMG OID637880106 
Productaldehyde dehydrogenase 
Protein accessionYP_481872 
Protein GI86741472 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCCGA CCCCGCAGAA CCCGCAGAAC CCGCAGAACC CGCAGACCGT AGCGAACCGG 
GAGACCGGGG TGACTGAGGG GGTCGGTCCG GCGGTCTACC CGGCGGCCGG CATCGGGACG
CTCCTGTCGA CGGATCCGGC CACCGGCGCG GTGGTCGCGA GCTTCCCGGT GATGGAGCCG
ACCCAGGTGC GGGCGGCCGT GGCGTCCGCC CGGTCGGCGG CGGGCTGGTG GGCGGGGCTC
GCGGCGGCAC GGCGGCGCGA CCATCTGCTG CGCTGGGCGG CGCACCTCGT CCGTCACGAG
ACCGAGCTGA TCGACCTCCT GCACGCCGAG AACGGCAAGA CCGCCGCCGA TGCCCGCATC
GAGCTGCTGC TGACGTTGGA ACACATCCGC TGGGCGGCCC GCAACGCCGC CCGCGTGCTG
CGGACCCGCC GGGTGTCGCC CGGCCCGTTC CTGGCGAACC ACACCGGACG GATCGAGTAC
CGCCCGTTCG GGGTCGTCGG GGTGATCGGG CCGTGGAACT ATCCGCTGTT CACCCCGAGC
GGTTCGATCG CGTATGCCCT GGCAGCCGGT AACACGGTGG TCTTCAAGCC CAGCGAGTAC
ACCCCGGCGG TGGGCGCGTA CCTGGTCGCG GCGTTCGCCG CGGCGAATCC GGACGCGCCG
GCCGGGGTCC TGACCATGGT CACCGGCTTC GGTCCGACCG GTGCCGCCCT GTGTACAGCC
GGGGTCGACA AGATTGCCTT TACCGGTTCG CCCGCGACCG GCCGCCTGGT CATGGCGGCC
TGCGCGTCGT CCCTCGTCCC GGTGGTGATC GAATGCGGCG GCAAGGACCC GCTGATCGTC
GCCGACGACG CTGACGTGGC CGCCGCCGCG CGGGCAGCGG CCTGGGGAGC GATGTCCAAT
GGCGGCCAGA CCTGTGCCGG GGTTGAGCGG ATCTACGTGA CCGAGGCGGT CGCCGCGCCC
TTCCTCGCCG CACTGCGCCG CGAGCTCGAC GGGGTTCGTC CCGGTGCCGA CCGGGACGCA
TCCTACGGTC CGATGACGAT GCCCGGCCAG GCGGCGATCG TGCGCCGCCA CGTGGCTGAC
GCGCTGGCCC GCGGCGCGAC CGCCCTGATC GGCGGGACGG AGTCGGTGGG GGACACCTTC
ATCGAACCGG TCGTGCTGGT CGACGTGCCG GAGGGCAGCC CCGCCGTGCA GGAGGAGACG
TTCGGGCCGG TCGCCACCGT CCGGACCGTC GCCGACGTTG ACGAGGCCGT CACGCTGGCC
AACGGCACCC CGTACGCGCT CGGTGCGACG GTTTTCTCCC GGTCCCGGGG AGACGAGATC
GCCAGCCGGC TCGACGCTGG GATGGTGTCC GTCAACGCGG TGCTGGCGTT TGCCGGGATG
CCCGCCCTGC CCTTCGGGGG CAGCGGCGAA AGCGGGTTCG GCCGGGTGCA CGGCGCGGAG
GGGCTGCGCG AGTTCGTCCG CCCCCGCTCG GTCGCCACCC TGCGGATGCA CGTGCCGGGA
GCGACCCTGA CCACCTTCCG CCGGACGCCG GGTGCGCTCG CCGTCACCGC GGTGACGGCG
CGGCTCCGGC ACGGCGGCGG CTGGGCCGGC TGGGCCAGCC GGGTGGCCGG GCGGGTCAGG
CCGGTCGGCG GGGCGGCCGG GCCTCTCGAC AGGCCCCTCG GCGGGCACGT CGGGTCGGAC
GAGGGATTCC GTCCCGGCCG GCTCCGGCTC CGAAAATCAT CCGGAAGTCG TCCAGACGGA
CGGTGA
 
Protein sequence
MRPTPQNPQN PQNPQTVANR ETGVTEGVGP AVYPAAGIGT LLSTDPATGA VVASFPVMEP 
TQVRAAVASA RSAAGWWAGL AAARRRDHLL RWAAHLVRHE TELIDLLHAE NGKTAADARI
ELLLTLEHIR WAARNAARVL RTRRVSPGPF LANHTGRIEY RPFGVVGVIG PWNYPLFTPS
GSIAYALAAG NTVVFKPSEY TPAVGAYLVA AFAAANPDAP AGVLTMVTGF GPTGAALCTA
GVDKIAFTGS PATGRLVMAA CASSLVPVVI ECGGKDPLIV ADDADVAAAA RAAAWGAMSN
GGQTCAGVER IYVTEAVAAP FLAALRRELD GVRPGADRDA SYGPMTMPGQ AAIVRRHVAD
ALARGATALI GGTESVGDTF IEPVVLVDVP EGSPAVQEET FGPVATVRTV ADVDEAVTLA
NGTPYALGAT VFSRSRGDEI ASRLDAGMVS VNAVLAFAGM PALPFGGSGE SGFGRVHGAE
GLREFVRPRS VATLRMHVPG ATLTTFRRTP GALAVTAVTA RLRHGGGWAG WASRVAGRVR
PVGGAAGPLD RPLGGHVGSD EGFRPGRLRL RKSSGSRPDG R