Gene Noc_1681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1681 
Symbol 
ID3705627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1875794 
End bp1877641 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content46% 
IMG OID637738159 
ProductAlpha amylase 
Protein accessionYP_343683 
Protein GI77165158 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCACT GCCATGGTAT GCCTTTTGGC GCTGAAGTGA CAGAAAAAGG TACCGTGCGT 
TTTCGGTTAT GGGCGCCTGC CGCTAAGCAA GTGGAGTTAT GCTTGGAGGA TACGCTAGAA
GTAGTCCCCT TAGCTATGAC TCCTAAAGAA GAGGGTTGGT TTGAGCTAGA AACTAAAGAG
GCTGGTCCGG GCAGTTTATA CTGCTATCAG ATCAATGGTG GGATGCGGGT TCCTGATCCC
GCTTCTCGAT TTCAGCCCCA AGATATCCAC GGTCCCAGCG AGGTGGTTGA TCCAGCTACT
TTTAAATGGC AGGAAGAGGG ATGGAATGGA CGACCTTGGG AAGAGGCCGT TATTTATGAA
ATCCATGTAG GAACTTTTAC CCCAGAGGGT ACTTTTCGAG GATTGGAGAG TCACCTGGAG
CATCTAGCAA AATTAGGGGT AACCGCATTG GAACTGATGC CGGTTGCCGA TTTCCCTGGC
CGTTGGAATT GGGGTTACGA TGGTGTTTCC TTGTTTGCAC CAGATAGCCG TTATGGTCGG
CCCCACGATC TCAAATCCCT TGTGCAAGCT GCCCATGCTT GTGGATTAAT GATATTTTTA
GACGTGGTGT ATAATCATTT CGGTCCGGAA GGCAACTATC TCCATCAATA TGCCCCAGAC
TTTTTTACAG AACGCCACCA AACTCCATGG GGGGCAGCTA TCAATTTTGA TGGGAAGAAT
GCCCATTGGG TCCGGCAGTT TTTTATCCAT AATGCTCTCT TCTGGCTGGA GGAATACCAA
TTTGATGGCC TTCGGTTGGA TGCGGTTCAT GCGATTCAAG ATGATTCTAA GTTCCATATT
CTTGAAGAGT TGGCAGAGAC GATTTTCTGC CATCTAGATT CCAGGCGGCG TATACATCTG
GTGCTAGAAA ATGATAATAA TATAGCCCGT TACCTTACCC GGAAGCCTAA CGGACAACCC
CGTTGGTACA CTGCGCAATG GAATGATGAT ATCCACCATG CCTTGCATGT ACTCACCACT
CAGGAAACTA CAGGTTATTA CTTAGACTAT GCCGATCAGC CTATTGCTCA TCTAGGCCGT
TGTTTGAGCG AAGGTTTTGG TTATCAAGGG CAACATTCTC CTTACAGGGA AGGCAAACCC
CGTGGCGAGC CTAGCAAGAT TTTACCGCCA AGCGCTTTTG TAACCTTCTT TCAAAACCAC
GATCAAGTGG GTAACAGGGC TTTCGGTGAG CGAATAACGG CTTTAATAAC ACCTGAAGAA
GTGAAGGCAT TAACCGCGTT GCTATTGCTC TCTCCTTTCC CGCCCCTTTT ATTTATGGGC
CAGGAGTGGG GATCAACTCA ACCCTTTCCT TTTTTTTGCG ATTTTAGTGA GGATTTGGCT
GCAAGTGTTC GGGAGGGTCG GCGAAGGGAA TTCGCCCATT TTCCTGAATT CAATAATCCA
GCGGCCCAAG AACGAATTCC GGATCCTACC GCTCAGGCGA CCTTTGACAA TGCTGTTTTA
AACTGGACTC ACGCAACCAA TGGAAAAGGG AAAGAATGGT TTGAGTTACA TCAAAATCTG
CTAAAACTAC GGCGCCAATG GATCATTCCC AGGCTAGCTG CTATGAGAAA AAACAACGGT
TGTTATATAC CCTTAGGTAA GCAGGCACTA CAAGTTCGGT GGCAATTAGG CGATGGGGCA
CAATTAACAA TATTAGCTAA CTTAGGAAAA ATTTCTATTT TCTTATCGAC TCTCCCTTCC
GGAGAAGTAC TTTTTACCAC CTTTTCAGAT TTAAATAGGA TACTTATCCA CAAAAATCTG
CCCCCTAAGA CGGTAATTTG GTTTCTCAAA GAGAATTCCA GTGATTGA
 
Protein sequence
MQHCHGMPFG AEVTEKGTVR FRLWAPAAKQ VELCLEDTLE VVPLAMTPKE EGWFELETKE 
AGPGSLYCYQ INGGMRVPDP ASRFQPQDIH GPSEVVDPAT FKWQEEGWNG RPWEEAVIYE
IHVGTFTPEG TFRGLESHLE HLAKLGVTAL ELMPVADFPG RWNWGYDGVS LFAPDSRYGR
PHDLKSLVQA AHACGLMIFL DVVYNHFGPE GNYLHQYAPD FFTERHQTPW GAAINFDGKN
AHWVRQFFIH NALFWLEEYQ FDGLRLDAVH AIQDDSKFHI LEELAETIFC HLDSRRRIHL
VLENDNNIAR YLTRKPNGQP RWYTAQWNDD IHHALHVLTT QETTGYYLDY ADQPIAHLGR
CLSEGFGYQG QHSPYREGKP RGEPSKILPP SAFVTFFQNH DQVGNRAFGE RITALITPEE
VKALTALLLL SPFPPLLFMG QEWGSTQPFP FFCDFSEDLA ASVREGRRRE FAHFPEFNNP
AAQERIPDPT AQATFDNAVL NWTHATNGKG KEWFELHQNL LKLRRQWIIP RLAAMRKNNG
CYIPLGKQAL QVRWQLGDGA QLTILANLGK ISIFLSTLPS GEVLFTTFSD LNRILIHKNL
PPKTVIWFLK ENSSD