Gene Noc_1228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1228 
Symbol 
ID3706418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1344780 
End bp1346030 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content43% 
IMG OID637737730 
ProductABC transporter, ATPase subunit 
Protein accessionYP_343259 
Protein GI77164734 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAACTT CCATGGGAGA AACCGATCCG GAAATTTTGA TTAATGTGGA TAATGTTAGC 
AAGTGCTATC ATTTATACCA CCGCCCTCAG GATCGACTGT GGCAGTCTTT TTTTCGAGGG
CGTAAAAAAT TTCACCGTGA ATTTTGGGCA TTACTCAATA TTTCCTTGCA GGTCCGGCGG
CATGAGGCTG TAGCTATCAT CGGGCGCAAT GGCGCGGGAA AAAGCACCTT GCTACAGATC
ATTGCCGGCA CTTTAGCGCC TACCTCAGGA AAAATCGCTG TTCAAGGCAG AGTTGCGGCC
TTGCTCCAAT TGGGAAGCGG TTTTAACCCT GAATTCACGG GGCGGGAAAA TGTGTTTCTA
AACGGCGCTA TCCTTGGGTT TAGCCGCGCT GAAATTAGCT GCCGCTTTGA AGCAATAACA
GCCTTTGCGG AGATCGGTAA TTTTATTGAT CAACCGGTTA AAACCTATTC TTCCGGGATG
ATCATGCGGC TTGCTTTCGC TGTTTCTACT TGTCTTGAAC CGGAAGTGCT GATCATCGAT
GAGGCTTTAG CGGTAGGGGA CGCAGCATTC CAATTTAAAT GCAGGGATCG GCTTCAGGGT
CTGATTACTC AAGGAACTAC ATTATTACTG GTTTCCCATG ATATGAGCGC TGTTAAATCG
TTTTGTCATC GTGCAGTCTA TATTGAAGAT GGCCAAAAAA AGGCGGAGGG TGAACCGGAA
CACATCTCTG AGCGTTATTT TATGGATGTG CGTGCCCGGC AGATGAGTCG CGTACAACAA
GAAAAAAAAG GCAGGGATAC CATCGTCTGT GAAAAGTCCC ATGGTTATGG CACTAGTGAA
GGAGAAATTG AGTCCGCTGT TTTTGTAGCA ACAAGAGGAA GTGCCGCTCT TTTTAATTAT
GGTGATACTA TTGAATTAAA GGTTACCTGC CAGTTTGCGT CTCACATAGA GTTTCCATGC
CTATCGGTAG TACTCCAGGC AACTAATCTA GTGGGTATAG GTGGGCGCTG GTTCCGTATT
AACCCGCTTT CTTCGGAATT CGATGCTGCT GCTATTACTT TAAAAATAAA GTTTCCCGCA
AAATTTAATG ATGGAAAATA TTTTATTACC CTAAGATTGG AAGATCGTAA AGATCAGAAA
CAATATTTTA TTTTACATAA AATTCCAGGC GCATTAACAT TTGATATTTT GCCGCGTACT
AATAACGATT TATTAGGGTT TAATGATATA AAACTTACCT GTGAGCAGTA A
 
Protein sequence
MLTSMGETDP EILINVDNVS KCYHLYHRPQ DRLWQSFFRG RKKFHREFWA LLNISLQVRR 
HEAVAIIGRN GAGKSTLLQI IAGTLAPTSG KIAVQGRVAA LLQLGSGFNP EFTGRENVFL
NGAILGFSRA EISCRFEAIT AFAEIGNFID QPVKTYSSGM IMRLAFAVST CLEPEVLIID
EALAVGDAAF QFKCRDRLQG LITQGTTLLL VSHDMSAVKS FCHRAVYIED GQKKAEGEPE
HISERYFMDV RARQMSRVQQ EKKGRDTIVC EKSHGYGTSE GEIESAVFVA TRGSAALFNY
GDTIELKVTC QFASHIEFPC LSVVLQATNL VGIGGRWFRI NPLSSEFDAA AITLKIKFPA
KFNDGKYFIT LRLEDRKDQK QYFILHKIPG ALTFDILPRT NNDLLGFNDI KLTCEQ