Gene Noc_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1023 
Symbol 
ID3707284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1133503 
End bp1134801 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content56% 
IMG OID637737528 
Productfolylpolyglutamate synthetase 
Protein accessionYP_343061 
Protein GI77164536 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCGTT TTTCCCGGCT TTCGGATTGG TTGAGATGGC AGGAAACCGC CCATTGGCCT 
CGGGTTGATC CCAGTTTAAT TAGAGCCAGC GCTGTGCTTC AGCGTATGGT GCTCTCCACT
CCTCCTTTTC CTATAGTGAC CATTGCCGGT ACTAATGGAA AAGGTTCTAC GGTGGCCCTA
TTGGAGGCCA TCCTGCTGGC CGCTGGTTAC CGGGTGGGCA GTTATACCTC TCCCCACCTA
TTGCGCTATA ACGAGCGGAT TAAGATACAA GGGGAAGCTG TCTCGGATGA TATCATTTGT
CAATCCTTTG CCCGTATCGA TGCTGCCCGC CACGAAATTT CCTTAACCTA CTTTGAATTT
GGCACCTTGG CCGCCATTGA TATTTTCCAT CAAGCGGGAT TGGACATAGC GTTGCTAGAG
GTGGGGCTAG GTGGGCGCTT GGATGCGGTC AATGCACTGG ATGCGGATGT GGCTGTGATT
ACCACGGTAG ATATTGATCA TATCAATTTT TTAGGGCCTG ATCGGGAGTC TATCGGGTTT
GAAAAAGCAG GTATTTTTCG CTCCCATCGT CCCGCCATCT GCGGCGATCC GGAGCCTCCA
GAGAGCGTAC TTGCCCACGC TAAGGAATCG TCGGTCCCCC TTTATCGGAT AGGCCGCGAT
TTTCATTATC AGATGGCCGG CGAAGGATGG TCCTGGTGGA GTAATGGGAG CCATTACGCG
GATTTACCCC ATCCCGCACT TCAGGGAGCT TGTCAGTATC AGAATGGGGC AGCTGCGCTG
ATGGCATTAA AGCTTATGGC AGCACGGCTA CCTGTTTCGG AGACAGCCAT CCGGGACGGC
CTCAGCGCGG TGCGCCTACC GGGCCGTTTC CAGTGCCTTC CCGGTGAGAT GGAGCGGATT
TTCGATGTGG CCCATAATCT TCAGGGCGCC CGCTGGTTAG CCCGCTCGCT TACGGACAGG
CCCTGCGGGG GGCGAACTTA CGCGGTGTTA GGGATGTTGG CGGATAAGGA TGTGGCAGGG
GTGGTCAGCG TCCTGGAGAA CGCTATCCAT GGCTGGTTTG TTGGTGGATT GGCGGTCGAT
AGAGGGCTAT CGGGCCAAGC TTTGGCTGAG CGGATGGGCC ATCTCACCCC CGCTATTTAT
CAAAGCGTGG CGGAAGCTTA CCGGGCCGCC CTGGAAGCCG CTCAACCGGG GGATCGGGTG
CTGGCCTTTG GTTCTTTTCA TACGGTGGAA GCTGTGATGC GGCTAGAGGG ATTGACCTGC
TCTTCCGGCG CTGAGCGTTG TAGCCTTGCT TCGGTTTAA
 
Protein sequence
MARFSRLSDW LRWQETAHWP RVDPSLIRAS AVLQRMVLST PPFPIVTIAG TNGKGSTVAL 
LEAILLAAGY RVGSYTSPHL LRYNERIKIQ GEAVSDDIIC QSFARIDAAR HEISLTYFEF
GTLAAIDIFH QAGLDIALLE VGLGGRLDAV NALDADVAVI TTVDIDHINF LGPDRESIGF
EKAGIFRSHR PAICGDPEPP ESVLAHAKES SVPLYRIGRD FHYQMAGEGW SWWSNGSHYA
DLPHPALQGA CQYQNGAAAL MALKLMAARL PVSETAIRDG LSAVRLPGRF QCLPGEMERI
FDVAHNLQGA RWLARSLTDR PCGGRTYAVL GMLADKDVAG VVSVLENAIH GWFVGGLAVD
RGLSGQALAE RMGHLTPAIY QSVAEAYRAA LEAAQPGDRV LAFGSFHTVE AVMRLEGLTC
SSGAERCSLA SV