Gene NATL1_04601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04601 
SymbolmetB 
ID4780470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp422288 
End bp423457 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content38% 
IMG OID640083737 
Productputative cystathionine gamma-synthase 
Protein accessionYP_001014289 
Protein GI124025173 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCAG TAAAAAGAGA CAAGAAGTTC AGCAATGGAG TAAACACAAG AGTCATTCAT 
CACAAAGATA ATTTTTCTGA AGGAACTGGT TCAATAATGC CGCCAATCTT TCCAACCTCA
ACGTTCGTTC ATGGCAATGA GGGTGGCTTT GATTACACTC GTTCAGGAAA TCCAAATTTT
CGAATTCTTG AATCAGTTTT GTCTGATCTG GAAGAGTGTC AATTCGCTAG TGTATTTAGT
TCTGGAGTCG CTGCGATTAC AGCAATAGTC TCCACTCTTC AGGCTGGAGA CTTAATCCTT
TGTGAGGAGA ACCTTTATGG ATGTACGGTG AGATTATTTG AACAAGTTTT TAATCGTTTT
GGATTAAAAA CTCAATGGAT AGACTTTACT AAGCCCAATT TCCAAGAAGT CATTTCAAAT
CACAAACCCG CGATGATTTG GATCGAAAGT CCTACTAACC CACTCCTCAA AATTATTGAT
ATTGAAGGGA TTTGTCATTT CTCAAATAAA ATGAAAATAC CTGTTGTTGT AGACAATACT
TTTGCAACAC CTCTATTACA AAGACCTCTT AAACTTGGAG CGACCTTATC TTTAACTAGC
ACGACCAAGT TTATTAATGG TCACTCAGAT GCACTTGGAG GTGCAGTATG CACCGAGAAT
CCTATCTGGA GAGACAAGCT AAATTTCGCC CAGAAAGCTC TTGGATTAAA CCCTTCTCCC
TTTGATTGCT GGCTTATCAC ACGAGGAATA AAAACTCTTC CACTTCGCCT AGAAAGACAA
GTTAATAATG CATCCAAAAT AGCTAATCAA TTAGCCGATA ATCCAGCAAT AAAATATGTT
CGATATCCTT TCAGGAATGA TCACCCACAA TGTAAATTAG CAAAAAAACA AATGGCTATG
GGAGGAGCAA TTGTTACTGC CACTGTTAAC GCAACCCAAG CTCAAACCTA TTCATTTTGT
AAAAGTCTTC ATTACTTCAA AATGGCAGAA AGTCTGGGAG GAATTGAAAG TCTTGTTTGC
CATCCAGCTA CAATGACACA TGCTTCAGTG TCCAAGGAAA CAAAATTAAA AATTGGAATT
ACTGATTCAC TTATTCGGTT TTCTATTGGA TGTGAGGACA TTGAAGACTT AAGTGCTGAT
TTGAATCAAG CCTTAGGAAC TATCTCTTGA
 
Protein sequence
MGSVKRDKKF SNGVNTRVIH HKDNFSEGTG SIMPPIFPTS TFVHGNEGGF DYTRSGNPNF 
RILESVLSDL EECQFASVFS SGVAAITAIV STLQAGDLIL CEENLYGCTV RLFEQVFNRF
GLKTQWIDFT KPNFQEVISN HKPAMIWIES PTNPLLKIID IEGICHFSNK MKIPVVVDNT
FATPLLQRPL KLGATLSLTS TTKFINGHSD ALGGAVCTEN PIWRDKLNFA QKALGLNPSP
FDCWLITRGI KTLPLRLERQ VNNASKIANQ LADNPAIKYV RYPFRNDHPQ CKLAKKQMAM
GGAIVTATVN ATQAQTYSFC KSLHYFKMAE SLGGIESLVC HPATMTHASV SKETKLKIGI
TDSLIRFSIG CEDIEDLSAD LNQALGTIS