Gene Francci3_0541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0541 
Symbol 
ID3904192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp626964 
End bp628412 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content69% 
IMG OID637877870 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_479654 
Protein GI86739254 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.482391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.20204 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCA ACACGTCGAC TTCCTCCACC ACGGACGATC TGACCACCGG GGCTCCCAAC 
GGCACCGGGG CCCCCGACGG CGCGAACGGC GTCGGGGGCC CGACCGGGAC CGTCGGCGGA
CCCGGGGAGC ATCCGGCCTA CGAGGCCGGC TTCACCGAGT CGGCGAACGG GCGGGTCTAC
ACCGTCACCG GCAGCGACTG GGAGCAGATC CTCGGCGTCG GCGAGGAGGA GAACGAGCGG
ATCGTCGTCA ACATGGGGCC GCAGCACCCG TCGACCCACG GGGTGCTCCG CCTGGTCCTG
GAGATCGAGG GCGAGACGGT CACCGAGACC CGCCTCGTCA TCGGCTACCT GCACACCGGC
ATCGAGAAGA GCTGTGAGTA CCGCACCTGG ACTCAGGCGG TCACCTTCCT CACCCGGGCG
GACTACCTCT CGCCGCTGTT CAACGAGGCG GCCTACTGCC TGTCGGTGGA GAGGCTGCTG
GGCATCACCG AGCAGGTACC CGAGCGGGCC ACGGTGATCC GGGTGATGGT GATGGAGCTC
CAGCGGATCG CCTCGCACCT GGTGTGGCTC GCGACCGGCG GCATGGAGCT CGGCGCCACC
ACCGCCATGA TCTTCGGTTT CCGGGAGCGG GAGAAGGTCC TCGACCTGCT CGAGCTCATC
ACCGGGCTGC GGATGAACCA CGCCTACATC CGGCCCGGGG GCCTCGCCCA GGATCTCCCC
GACGGCGCCG AGCGGGCCAT CCGGGCGTTC CTCGCGGACA TGCCGAAGCG GATCAGGGAG
TATCACGCGC TGCTCACCGG CCAGCCAGTC TGGAAGGCCC GGATGGTCGA CGTCAACGTT
CTCGACGCGG CCGGCTGCAT CGCGCTGGGG ACCACGGGCC CGGTGTTGCG CGCCGCGGGC
CTGCCGTGGG ACCTGCGCAA GACCATGCCC TACTGCGGCT ACGAAACCTA CGAGTTCGAC
GTGCCGACCG CGCTGGAGGG CGACTCCTTC GCCCGCTACC TGGTGCGGCT GGAGGAGATG
GGCGAGTCAC TCAAGATCGT TGATCAGTGT CTGGACCGGC TGCGTCCCGG CCCGGTCATG
GTCGCCGACA AGAAGATCGC CTGGCCGTCC CAGCTTTCTG TCGGGTCCGA CGGGACGGGC
AACTCACTCG CGTACATCCG GAAGATCATG GGGACCTCGA TGGAGGCCCT GATCCATCAC
TTCAAGCTGG TGACCGAGGG ATTCCGCGTC CCGGCCGGTC AGGTCTACAC CCAGATCGAG
TCGCCGCGCG GAGAGCTCGG CTACCACGTG GTCAGCGACG GCGGCACGAG ACCCTTCCGC
GTCCACGTGC GGGATCCAAG CTTCGTCAAC CTGCAGGCCG TCCCGGCGCT GACCGAGGGC
GGCCAGGTGG CGGACGTGAT CGTCGGGGTC GCCTCAGTCG ACCCGGTGCT CGGGGGAGTT
GATCGTTGA
 
Protein sequence
MTTNTSTSST TDDLTTGAPN GTGAPDGANG VGGPTGTVGG PGEHPAYEAG FTESANGRVY 
TVTGSDWEQI LGVGEEENER IVVNMGPQHP STHGVLRLVL EIEGETVTET RLVIGYLHTG
IEKSCEYRTW TQAVTFLTRA DYLSPLFNEA AYCLSVERLL GITEQVPERA TVIRVMVMEL
QRIASHLVWL ATGGMELGAT TAMIFGFRER EKVLDLLELI TGLRMNHAYI RPGGLAQDLP
DGAERAIRAF LADMPKRIRE YHALLTGQPV WKARMVDVNV LDAAGCIALG TTGPVLRAAG
LPWDLRKTMP YCGYETYEFD VPTALEGDSF ARYLVRLEEM GESLKIVDQC LDRLRPGPVM
VADKKIAWPS QLSVGSDGTG NSLAYIRKIM GTSMEALIHH FKLVTEGFRV PAGQVYTQIE
SPRGELGYHV VSDGGTRPFR VHVRDPSFVN LQAVPALTEG GQVADVIVGV ASVDPVLGGV
DR