Gene Rcas_1321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1321 
Symbol 
ID5538793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1701264 
End bp1702517 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content61% 
IMG OID640893458 
ProductNADH dehydrogenase I, D subunit 
Protein accessionYP_001431435 
Protein GI156741306 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.735553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTG CAGAAACCCG TGCGCGCAAT CTGAGCATCC CCGCTCCCAG CCAGATTACC 
CGTCCGGCGC TCGAAGGGGT GAAAGAGACC ATGGTTCTGA ACATGGGTCC GCATCACCCC
AGCACCCACG GCGTGCTGCG GCTGGTCGTC GAACTCGATG GCGAGACAGT CGTTGATGTT
GCGCCCGACA TTGGCTACCT CCATACCGGC ATCGAAAAAA CGATGGAGAG TAAAACGTAT
CAGAAAGCGG TCGTGTTGAC CGACCGCACG GATTATCTGG CGCCGCTCTC CAACAATCTG
AGTTATGTGC TGGCGGTCGA GAAACTGCTC GGATGCGAGG TTCCAGAGCG CGCCACCGTT
GCGCGGGTGC TGCTGGTCGA ACTGCAACGC ATCGCCAGCC ATCTGGTGTG GCTCGGCACG
CACGCGCTCG ACCTGGCAGC CATGAGTGTG TTCCTCTATG GCTTCCGCGA ACGCGAACAA
ATTCTCGATA TTTTTGAACT GGTCTCGGGC GCGCGCATGA TGACCAGTTA TTTCCGCGTC
GGCGGGCTGG CGTATGACCT GCCAATCGAG TTCGATGCCG CTGTCGAGGC ATTCCTTGCG
ATCATGCCGG GGCGCATCGA TGAGTACGAA GCGCTGCTGA CCGATAATCC GCTCTGGATC
GAGCGCACGC AGGGCATTGG CGCCATCGAT AGCGAGGCCG CCATTGCGCT GGGGCTGACC
GGACCTGGAC TGCGCGCGAC TGGCGTGGCG TGGGACTTGC GCAAAACCAT GCCCTACTGC
GGCTATGAGA CCTACTCCTT CGCCGTTCCG ACCGCCACCC ATGGCGATAT TTATGACCGC
TATCTGGTGC GCATGGCGGA AATGCGCGAA AGTGTCTCGA TCTGTCGCCA GGCGTTGCAA
CGGTTGCGCG ACATCGGTCC TGGACCGTAT ATGACGCTGG ATCGCAAGAT TGCGCCGCCG
CCGAAGAGCG AAATTACGCA GAGCATGGAG GCGCTCATTC ACCATTTTAA GTTGTGGACC
GAGGGCTTTA AGCCGCCGCG CGGCGATGCG CTGGCGGCGG TGGAGTCGCC TCGCGGCGAA
TTGGCAACCT ACATCGTGAG CGATGGCAGC GCCAAACCGT ATCGGGTCCA TTTCCGCGCG
CCTTCGTTTG TCAATCTGCA ATCGCTGCCG CACATGGCAC GTGGGCATCT GGTCGCCGAT
CTGGTGGCGC TGATTGCCTC GCTCGACCCG GTGCTCGGAG AAGTTGATCG CTAA
 
Protein sequence
MTVAETRARN LSIPAPSQIT RPALEGVKET MVLNMGPHHP STHGVLRLVV ELDGETVVDV 
APDIGYLHTG IEKTMESKTY QKAVVLTDRT DYLAPLSNNL SYVLAVEKLL GCEVPERATV
ARVLLVELQR IASHLVWLGT HALDLAAMSV FLYGFREREQ ILDIFELVSG ARMMTSYFRV
GGLAYDLPIE FDAAVEAFLA IMPGRIDEYE ALLTDNPLWI ERTQGIGAID SEAAIALGLT
GPGLRATGVA WDLRKTMPYC GYETYSFAVP TATHGDIYDR YLVRMAEMRE SVSICRQALQ
RLRDIGPGPY MTLDRKIAPP PKSEITQSME ALIHHFKLWT EGFKPPRGDA LAAVESPRGE
LATYIVSDGS AKPYRVHFRA PSFVNLQSLP HMARGHLVAD LVALIASLDP VLGEVDR