Gene P9211_07871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_07871 
SymbolaspC 
ID5731885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp692459 
End bp693640 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content39% 
IMG OID641285151 
Productaminotransferase class-I 
Protein accessionYP_001550672 
Protein GI159903328 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.404062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00724094 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAAAT CACATCTGAT CTCCGAAAGA ACACAAGAAT TACAGCTCTC ATTGACATTA 
GAGATCAGTG CTCGAGCAAA GTTACTAAAA AAGGAAGGGA AAGATATATG CAGCCTTAGC
GCTGGCGAAC CAGACTTTGA TACACCAAAT TACATTGTAA ATGCAGCAAT AGAAGCTCTA
AGAAATGGCA TTACAAGATA TGGTCCAGCT GCTGGCGATC CAGAACTCAG AGAGGCTATT
GCACAAAAAC TCACAACCTC GAATAACGTC CCATCAAAAG CAGAAAACAT TCTTATAACT
AATGGTGGGA AACAAGCCAT TTTCAACTTA TTTCAGATAA TCCTTAACCC AGGGGACGAA
GTGTTGATCC CCTCTCCTTA CTGGCTGAGT TACCCAGAGA TAGCAAAGCT AGCAGGTGCG
ATACCAGTAC CCTTACACAC ATCACCCAAA GATGGCTTTA AATTAAGTTC TGAAAAACTA
GAAGAGAAGA TTACAAACAG AACAAAACTC TTAATACTTA ATTCTCCTTG CAATCCGACA
GGTCGAGTAA TTCAAAAAGA AGAGCTTATC TCTATTGCTG AGGTATTACG CAGGAATAAA
CAACTCCTAG TGATGACTGA TGAAATTTAT GAATATCTAA TATCTGAAAA TGAATCGCAT
CATAGTCTTG CAGCGATTGC TCCAGACTTA AGAGAAAGAA TATTTATTGT TAACGGATTC
GCCAAAGCAT GGGCGATGAC AGGTTGGCGA ATAGGGTACT TAGCTGGCCC AAAAGAATTT
ATTAAAACTG CCATCGCATT ACAAAGCCAA AGTACGAGTA ATGTATGTAG CTTTGCTCAA
CGTGGTGCCC TTGCTGCACT TCTAGGGCCA AAGGAATCTA TAAAAACAAT GAGTAGAAGC
TATAACGAAC GAAGAGAAAT ACTTACTAAA GGGCTTAATA GTATTAATGG AATTTCGTTA
ATTCCACAGA AAGGTGCGTT TTATGCATTC CCAGAATTAG CCCCGTCACT TCCAAACTCA
CTAAGTTTCT GCAAATTAGC CTTGGAGAAA GTAGGGCTAG CAATAATTCC TGGCATTGCC
TTTGGAGAAG ATCGGTGCGT AAGATTATCT TGTGCAGTTT CAGAAGATAC AATCAAAGAA
GGTATTGCAC GTCTCGAAAA ACTGATAACA CAACTAATTT GA
 
Protein sequence
MSKSHLISER TQELQLSLTL EISARAKLLK KEGKDICSLS AGEPDFDTPN YIVNAAIEAL 
RNGITRYGPA AGDPELREAI AQKLTTSNNV PSKAENILIT NGGKQAIFNL FQIILNPGDE
VLIPSPYWLS YPEIAKLAGA IPVPLHTSPK DGFKLSSEKL EEKITNRTKL LILNSPCNPT
GRVIQKEELI SIAEVLRRNK QLLVMTDEIY EYLISENESH HSLAAIAPDL RERIFIVNGF
AKAWAMTGWR IGYLAGPKEF IKTAIALQSQ STSNVCSFAQ RGALAALLGP KESIKTMSRS
YNERREILTK GLNSINGISL IPQKGAFYAF PELAPSLPNS LSFCKLALEK VGLAIIPGIA
FGEDRCVRLS CAVSEDTIKE GIARLEKLIT QLI