Gene Synpcc7942_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0101 
Symbol 
ID3773441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp101259 
End bp102593 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content57% 
IMG OID637798507 
Producttype 2 NADH dehydrogenase 
Protein accessionYP_399120 
Protein GI81298912 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCGA CTGTCAGCAA ACCGCGTGTC GTCGTCATTG GCGGCGGTTT TGGAGGGCTG 
TATACCGCCC TCAATCTTGG CAAGACCTCT GTCGAGCTCA CCCTGATTGA TAAACGAAAC
TTTCACCTGT TTCAACCCTT GCTCTACCAA GTGGCTACAG GGGAAATTTC GCCGGGCGAT
ATCGCTGCGC CGCTCCGGGC GATCGTAGGT CGTAACCCCA ATACCCGCGT CATTCTCGGT
GAAGTGACTG ACATCGATCC GCAGGCCCAT TGGGTGCGCG TTGGCGATGA AATTGTCGAA
TACGACTACT TGGTTGTGGC GACGGGTGCC AGCCACCACT ACTTTGGCAA CGACCAATGG
CAGCCCTTTG CTCCGGGGCT GAAAACGGTT GAAGATGCGC TGGAAATGCG CCGCCGGATT
TACTTTGCCC TCGAGCAAGC TGAGCAGGAG AGCGATCCAG AGCGTCAGCA AGCTTGGTTG
ACCTTCACGA TCGTGGGAGC AGGCCCCACC GGCGTTGAAC TAGCTGGCGC GATCGCGGAA
TTAACCCGCG GTGAAATGCG CAAAGAATTC CGCAATGTCG ACACCACCAA AGCCAAGGTC
ATTTTGATTG AAGGCATGGA TCGGGTCTTA CCACCCTTCC CGCCAGAGCT GTCAGCCCAA
GCGCAAGTAC AGCTAGAAGG CTTGGGCGTG ACTGTGCAAA CCAAAGCCAT GGTCACCGAC
ATTCAAGAAG ATCGCGTCGT CTTTAAGACT GGCGACGACT TGCATGAGAT CCCTAGCCGC
ACGACCCTTT GGGCTGCGGG CGTCAAAGCG TCACCCTTGG GCAAGCTCCT AGCGCAACGA
ACCGGTGCAG AACTCGATCG CATTGGTCGC GTCATCGTCC AGCCTGATTT GCAGCTGCCG
ACTGACCCCA ACGTCTACGT CTTGGGTGAC CTGGCCCACT GCCCTGATCA AGCAGGCAAC
CCACTGCCCG GTGTGGCAGC AGTCGCGATG CAGCAGGGGG CTTATCTCGG TAAGGCACTC
AAGCGGCGGC TGAAGAGTCA ACCCGTTGAT CCCTTCCGCT ACCAAGACTT CGGCAGCATG
GCAGTGATTG GCCGTAACGC TGCGGTTGCC CGCTTAGCAG GTATTCGCCT CAGTGGTTTC
CCCGCATGGC TGGTCTGGGC TTTTATCCAC GTCTGGTATT TGATTGAATT CGACAGCAAA
TTGCTGGTGA TGGTGCAGTG GGCTTGGACC TACTTCAACC AGAAACGCGG CACTCGCCTA
ATCGTCAATC ATCACCGCAT GTCGGCCCCG GCAGCGATGA CTAATCCGGC CGAAAAAGAG
TTGGCGAAGT CCTAG
 
Protein sequence
MISTVSKPRV VVIGGGFGGL YTALNLGKTS VELTLIDKRN FHLFQPLLYQ VATGEISPGD 
IAAPLRAIVG RNPNTRVILG EVTDIDPQAH WVRVGDEIVE YDYLVVATGA SHHYFGNDQW
QPFAPGLKTV EDALEMRRRI YFALEQAEQE SDPERQQAWL TFTIVGAGPT GVELAGAIAE
LTRGEMRKEF RNVDTTKAKV ILIEGMDRVL PPFPPELSAQ AQVQLEGLGV TVQTKAMVTD
IQEDRVVFKT GDDLHEIPSR TTLWAAGVKA SPLGKLLAQR TGAELDRIGR VIVQPDLQLP
TDPNVYVLGD LAHCPDQAGN PLPGVAAVAM QQGAYLGKAL KRRLKSQPVD PFRYQDFGSM
AVIGRNAAVA RLAGIRLSGF PAWLVWAFIH VWYLIEFDSK LLVMVQWAWT YFNQKRGTRL
IVNHHRMSAP AAMTNPAEKE LAKS