Gene MCA2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2202 
Symbol 
ID3105073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2378928 
End bp2380109 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content64% 
IMG OID637171348 
Productaspartate aminotransferase 
Protein accessionYP_114622 
Protein GI53803503 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATAC GACTTTCCGA CCGCGTCCAG TCCATCAAGC CGTCCCCGAC TCTCGCCGTC 
ACCGCCCGCG CCGCCGCGAT GCGCGCCGCC GGCAAGGACA TCGTGGGACT GGGCGCGGGC
GAACCGGACT TCGACACGCC GGACCATATC AAACAGGCGG CCATCCAGGC CATCGAAAAG
GGTTTCACCA AATACACGGC GGTCGATGGA ACGCCCGGGC TCAAGCAGGC GATCCAGGCG
AAATTCAAAC GCGAAAACGG GTTGGATTAC GCGCTCGATC AGATCCTGGT GTCCTGCGGC
GGCAAGCAGA GTTTCTACAA TCTGGCCCAG GCCCTGCTCA ACCCCGGCGA CGAGGTCGTC
ATCCCGGCGC CTTACTGGGT GTCCTATCCG GACATGGTGC TGCTGGCCGG CGCCGTCCCG
GTGATCGTCG AGGCCGGGCA ACAGCAGGCG TTCAAGATCA CGCCGGCACA ACTGGAAGCC
GCGCTGACGG CCAGAACCCG GCTGTTCGTG ATCAACAGTC CATCCAATCC CACCGGCATG
GCCTACACCG CGGAAGAGCT GGCCGGCCTC GGTGAGGTGC TGCGGCGGTT TCCCGAGGTC
GTCATCGCCA CCGACGACAT GTACGAGCAC ATCCTCTGGG AAGGTGGATT CAGCAACGTC
CTGAACGTCT GCCCGGACCT GTACGAGCGG ACCGTGGTGC TGAACGGCGT GTCCAAAGCC
TACTCGATGA CCGGCTGGCG CATCGGCTAC GCAGCCGGGC CCGAGCGGCT GATCGAGGCC
ATGACCAACA TCCAGTCGCA GAGCACCTCC AATCCCACTT CGATCTCGCA GGTCGCGGCA
GAGGCCGCGC TCAATGGCGA GCAGGGCTTC ATCGCCGGCA TGGTGGAGGC TTTCAAGCAA
AGGCACGACT TCGTGGTCGG AAGACTGAAC GCCATTCCCG GCGTCGACTG CCTGAAAACC
CACGGCACCT TCTATGTCCT GCCGAATGTC GAAGCGGCGA TGGCCAGGCT GCATCTGGCG
GACGACGTGG CGCTGTCCGA ATACCTGATC GAACAGGGCG GCGTGGCCGT GGTGCCGGGC
TCGGCTTTCG GCGCACCGGG CCACGTCCGT CTCTCCATCG CCACCAGCAT GGCCAATCTG
GAAAAGGCCA TGGAACGCCT GGCGACCACC CTGTCCAAAT GA
 
Protein sequence
MSIRLSDRVQ SIKPSPTLAV TARAAAMRAA GKDIVGLGAG EPDFDTPDHI KQAAIQAIEK 
GFTKYTAVDG TPGLKQAIQA KFKRENGLDY ALDQILVSCG GKQSFYNLAQ ALLNPGDEVV
IPAPYWVSYP DMVLLAGAVP VIVEAGQQQA FKITPAQLEA ALTARTRLFV INSPSNPTGM
AYTAEELAGL GEVLRRFPEV VIATDDMYEH ILWEGGFSNV LNVCPDLYER TVVLNGVSKA
YSMTGWRIGY AAGPERLIEA MTNIQSQSTS NPTSISQVAA EAALNGEQGF IAGMVEAFKQ
RHDFVVGRLN AIPGVDCLKT HGTFYVLPNV EAAMARLHLA DDVALSEYLI EQGGVAVVPG
SAFGAPGHVR LSIATSMANL EKAMERLATT LSK