Gene Noc_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0101 
Symbol 
ID3705861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp102414 
End bp104069 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content56% 
IMG OID637736617 
Productferredoxin-dependent glutamate synthase 
Protein accessionYP_342164 
Protein GI77163639 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0069] Glutamate synthase domain 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTTCCC TAAAACTCTC TGTTAAGCGC TTTAATTCCG ATGTGCTGAA ACGAGCCATC 
GTGCCAATTA TGGCTGGGGG ACTCTCGCTA ATTTTCCTCT CCCTGGTAAC GGTGAGTCCT
TGGTGGCTGC TAGGACTTAC CGTGACCCTG CCGCTAGTGG CGCTCGGCGT CTATGATTGG
GTGCAGCAAT GTTGGACCCT GACACGCAAC TATCCAGCTG CTGCCCGCAT CCGCTGGCTC
TTCTACGACC TTCGGCCCTA TCTGCGTGCC TATATCGTTG AGGACGATTT AGAAGGCAAG
CCATTTAGCT TTGATGCCCG TAACCTGGTT TACGCCCGCG CCCGAGGTAA GACCGATACT
CACCCCTTTG GGACAGAGCG CGATCTGGCC TCTGATGAAT ATGAATGGCT GTGTCATTCC
ATGGCACCGG CTGAAAATCC AGAAAAAAAC CCACGTGTCT CCATTGGCAG CGCCCAATGC
GAGCAGCCCT ATAACGCCTC ACTCCTCAAT ATTTCGGCCA TGAGTTTCGG CTCCCTCTCG
GCCAGGGCGG TAGAAGCACT GAACAAGGGA GCACGGCTTG GCGGGTTTTA TCATGACACG
GGCGAAGGAG GTATTAGCCC CTATCATCTA AAGTGGGGTG GGGATCTGGT TTGGGAAATC
GGTTCAGGCT ATTTCGGTTG CCGGGATGAT AAAGGAAATT TTGACCCAGG CTTGTTTCAT
GAAAGCGCTA CGCGCAATGA AGTCAAGATG ATTGAAATCA AACTCAGTCA AGGCGCGAAA
CCTGGCTATG GCGGGCTCTT GCCAGCTGCT AAGGTAACCG AGGAAATTGC GTGGGTACGT
AAGGTTCCCA AGGGGCAAGA CTGTCTCTCG CCTCGGAGCC ATTCGGCCTT TTCTACTCCC
TTGGAGATGT TGGAGTTCGC AGCTAGGATG CGGCAGCTTT CTGGCGGCAA ACCGGTAGGC
ATCAAGCTTT GCGTTGGACA AGTTCATGAA GTGCTGGCGA TTATGAAAGC AATGCTCAAG
ACCGGTATCC ATTTGGACTT TATCGTGGTC GACGGGGGCG AGGGGGGCAC AGGGGCGGCA
CCGGTGGAGC TATCGGACTA CGTGGGAATG CCCCTAACCG AAGGGCTGAT GGTAGTTCGC
AACGCCCTAG TTGGCACCGG ACTGCGGGAT AAGGTGCGGC TTGGAGCCAG CGGCAAAGTC
TATTCAGGCG CTGGCATGGC CCGCAATTTC GCCATCGGCG CGGATTGGTG TAACGCAGCA
CGGGCTTTTA TGTTTTCCAT TGGCTGCATT CAGGCCCAGC GCTGCCATTT GGGCACATGC
CCAACCGGCG TTACCACGCA AGATCCTGGC CGCCAGCGGG GCCTGGTAGT GGATGTGCAG
GCTAAAAGGG CAGCGCGGTT TCACCAACAA ACCCTTACTG CGCTTGGCCA TATCGTCGCC
GCGGCGGGTC TGGCTCATCC TCGCGATCTG CAGCCTTATC ATCTTATTCG CCGAGTTGGC
ACAGCCGAGG CTAAACTCTT TGACCGAATT TATCCCTTCC TGCCGGAAAA CGCCCTACTT
GAGGGGGCCG AAGATACTCC CTACAGCGAA TGGTGGCAGG CTGCTAATCC TGCCAGTTTT
CGGCCGCGTA TTGACTTGGC CGCTGCACGA ATCTAA
 
Protein sequence
MSSLKLSVKR FNSDVLKRAI VPIMAGGLSL IFLSLVTVSP WWLLGLTVTL PLVALGVYDW 
VQQCWTLTRN YPAAARIRWL FYDLRPYLRA YIVEDDLEGK PFSFDARNLV YARARGKTDT
HPFGTERDLA SDEYEWLCHS MAPAENPEKN PRVSIGSAQC EQPYNASLLN ISAMSFGSLS
ARAVEALNKG ARLGGFYHDT GEGGISPYHL KWGGDLVWEI GSGYFGCRDD KGNFDPGLFH
ESATRNEVKM IEIKLSQGAK PGYGGLLPAA KVTEEIAWVR KVPKGQDCLS PRSHSAFSTP
LEMLEFAARM RQLSGGKPVG IKLCVGQVHE VLAIMKAMLK TGIHLDFIVV DGGEGGTGAA
PVELSDYVGM PLTEGLMVVR NALVGTGLRD KVRLGASGKV YSGAGMARNF AIGADWCNAA
RAFMFSIGCI QAQRCHLGTC PTGVTTQDPG RQRGLVVDVQ AKRAARFHQQ TLTALGHIVA
AAGLAHPRDL QPYHLIRRVG TAEAKLFDRI YPFLPENALL EGAEDTPYSE WWQAANPASF
RPRIDLAAAR I