Gene ECD_02201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02201 
SymbolnuoN 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2277233 
End bp2278510 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content58% 
IMG OID 
ProductNADH dehydrogenase subunit N 
Protein accessionACT44023 
Protein GI253978353 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGTTA CGCCGCTGAT GCGCGTTGAT GGTTTCGCCA TGCTTTACAC CGGGCTGGTA 
TTGTTGGCGA GCCTCGCCAC CTGTACTTTC GCCTACCCGT GGCTTGAAGG CTATAACGAC
AACAAGGATG AGTTCTACCT GTTGGTGTTA ATTGCCGCGC TGGGCGGGAT CCTGCTGGCG
AATGCCAACC ATCTGGCGTC TCTGTTCCTC GGTATCGAAC TGATCTCTTT GCCGCTGTTT
GGCCTGGTCG GTTACGCTTT CCGCCAGAAA CGTTCACTGG AAGCCAGTAT CAAATACACC
ATCCTTTCTG CCGCAGCGTC TTCTTTCCTG CTGTTTGGTA TGGCGCTGGT GTATGCGCAG
TCTGGCGACC TGTCGTTTGT CGCGTTGGGT AAAAACCTTG GCGACGGTAT GCTCAACGAG
CCGCTGTTGC TGGCAGGTTT CGGCCTGATG ATTGTTGGCC TCGGCTTCAA ACTCTCTCTG
GTGCCGTTCC ACCTGTGGAC GCCAGACGTA TACCAGGGCG CGCCTGCGCC GGTTTCCACT
TTCCTGGCGA CGGCGAGCAA AATCGCTATC TTCGGTGTGG TGATGCGTCT GTTCCTCTAC
GCACCGGTGG GTGACAGCGA AGCGATTCGC GTGGTGCTGG CGATTATCGC CTTTGCCTCC
ATCATCTTCG GTAACCTGAT GGCGCTGAGC CAGACCAATA TCAAACGTCT GCTCGGTTAC
TCATCTATCT CTCACCTCGG CTATCTGCTG GTAGCGCTGA TTGCGCTGCA AACCGGCGAG
ATGTCGATGG AAGCGGTAGG GGTTTACCTG GCCGGTTATC TGTTCAGCAG CCTCGGCGCG
TTCGGCGTGG TCAGCCTGAT GTCCAGCCCG TATCGTGGCC CGGATGCTGA TTCCCTGTTC
TCTTACCGCG GTCTGTTCTG GCATCGTCCG ATCCTCGCGG CAGTGATGAC GGTGATGATG
CTGTCTCTGG CCGGTATCCC GATGACGCTG GGCTTTATCG GTAAGTTCTA CGTGCTGGCG
GTCGGTGTCC AGGCACACTT GTGGTGGCTG GTGGGTGCCG TGGTTGTCGG TTCGGCAATC
GGCCTCTACT ACTACCTGCG CGTGGCGGTG AGCCTGTACC TGCACGCCCC GGAACAACCG
GGTCGCGATG CACCATCAAA CTGGCAGTAC AGCGCGGGCG GTATCGTGGT GCTGATTTCT
GCACTGTTGG TACTGGTGCT GGGTGTATGG CCACAACCGC TGATTAGTAT TGTGCGTTTG
GCAATGCCGC TGATGTAA
 
Protein sequence
MDVTPLMRVD GFAMLYTGLV LLASLATCTF AYPWLEGYND NKDEFYLLVL IAALGGILLA 
NANHLASLFL GIELISLPLF GLVGYAFRQK RSLEASIKYT ILSAAASSFL LFGMALVYAQ
SGDLSFVALG KNLGDGMLNE PLLLAGFGLM IVGLGFKLSL VPFHLWTPDV YQGAPAPVST
FLATASKIAI FGVVMRLFLY APVGDSEAIR VVLAIIAFAS IIFGNLMALS QTNIKRLLGY
SSISHLGYLL VALIALQTGE MSMEAVGVYL AGYLFSSLGA FGVVSLMSSP YRGPDADSLF
SYRGLFWHRP ILAAVMTVMM LSLAGIPMTL GFIGKFYVLA VGVQAHLWWL VGAVVVGSAI
GLYYYLRVAV SLYLHAPEQP GRDAPSNWQY SAGGIVVLIS ALLVLVLGVW PQPLISIVRL
AMPLM