Gene MCA2561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2561 
Symbol 
ID3102343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2740479 
End bp2743352 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content55% 
IMG OID637171699 
Productglycosyl transferase, group 2 family protein 
Protein accessionYP_114969 
Protein GI53803315 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis
[COG3754] Lipopolysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATA CTCCTGTTGA ACCAGTTCGC CTGACCCCTG ACAATCCACT GGTTTCAGTG 
GTAATACGAA CGAAGGACCG CCCCAAACGT CTCGCAGAAG CCGTACGAAG CGTACGCGAA
CAAACTCACC GCCCATTGGA AGTGATCGTC GTCAACGACG GCGGATGGCC CTTGCCCGAG
GAGGCTCTCA AGGAGCAGGC CGGTGACGTA GAACTCATAC TGATTCAACT GGAACAGAAC
CAAGGCCGCT CCATCGCCGC CAATACGGGA CTGCAAGCCG CCACGGGACG CTATCTGTGT
TTTCTCGATG ACGACGACAG GTTCCTGCCG GACCATATCG AACTTCTATC CTCTTGCCTG
GAACACATAG AACACCGGGT GTGTTATTCC GACGCCGAGC TTTGCTGGCA GGTTTACAAT
GTCGAAACAG GTGAATTCGA TATCGTCAAT CGCCAGGTAT TTGGCTCGCG AGACTTCAAC
CTCGCCGAAC TGCTCTGTGG CAACTACATT CCGCTAAACA CCCTTCTTTT CGATCGACAG
GTCTTACTGC AAGTTGGGGG GTTCGATCCG CAGTTCGACA TTTACGAAGA TTGGGACCTC
CTGCTGCGTG TCGGCAGCCT CTACCCTTTC TATCACCTGC CTCGTGTGAC CGCACAGTAC
AACCAATGGA GCAAGGATCA TCAAAACTTT TACAACCATT TTTCCCAAAC CGGCCAGGCT
CCCCTCTTAC ACATTACCGG CTACGACAGG CTCATCGAAA AAAACCGCCA TCTTTTCAGC
GCTGATGCGG CTAGGCATCT GCTGTCCGTC ATCAACCGCC TGCGGGCCAT CGAGGCAGAA
CTTGCTCAAC GCGATGCCGT CCCGGCCGAA TCCCCTCTAA CCCGACTGAC GGCACAACTC
GAGCAAACGC TCGCGACCAT CGGTCCATCC TTTGCCACGC TCGAAGGACG TCTCGCCGAT
CTCGACGCCT GCCAACGTGA CGCCCTGGTT TCGGCCACGT CCAAACTAAA TGAGGGTGTC
AGCGACCTCG ACGGGCTGCG CTCCCAGGTC GCTCACCTTT GCATGGAATT CGAAAACGCC
TCGGGGAAGA TCGATACCAG CCAAGCCGAG CTGCAAAAAA CCGCGACCCT GCTCGTCCAG
ACTCAAAACC TGCTAGAGGA TTATCGAAAA GAAATTGCCC GAACAACCGA GCTCAATCGA
CTCATCTCCG AGCGCGACAG CGAACTCCGA ACGCTTAGGC AGACCATCGC ACGGAACGAG
GCGGAAATCA CCGCACTTCA TAAAGAAAGG AACGAGGCTA ACGCGACGAT CTCCGCCCTC
GATGAAAAGG TCGCTCACGC GGAGACAGAG AAGGCTTCCT TGCATCAAAT CATCGCCCAA
ACGGAGGCCC GGCTCGCTTC AGTCTACGGC TCGACCAGTT GGCGTATAGC CGCGCCACTA
CGCGCCGTCT CCGTGGCTGT GAGATGGCTC TTACGAAACA CCCGCCGAGC GTTCCTGCTG
ATGTGGTGGC TGTGTACCGG CCAGTACGCC AGGGCAACAA ACCACGCTTT CCCCCATCTT
TGGCGTCACG TGCCGCCCCG GCTAAAAATG ATCATTCCGC CCCGTCTCGC CGGATCGGTG
AAACGCCGCC TTCACCTGGC TGATATCACA GCCCCCCGGC CCCTGGAAAA TCAGCCAGAC
AACTCTCACC CGGATACCAC CGCCGAAGCA CATGTAACAT TGGATGTCCC GGAAAGACAT
TACGTCGACC TATCGTCCGA ACCGGTCGCG CATACACCGA TCAAGGCGAT AGCTTTCTAT
CTTCCCCAAT TTCACACCAT CCCGGAAAAC GATAAATGGT GGGGAGAAGG TTTCACCGAA
TGGACCAATA CCCGCAAAGG CAAACCCCTG TTTGACGGGC ACCATCAGCC ACGCGTGCCG
TTGCATCTCG GCTACTACAA CCTCGAAAAC ATCGAGGTGT ATGCCCAGCA AGTACAACTC
GCCAGGAAAG CCGGACTTTA TGGGTTCTGC TTCTATTTCT ATTGGTTTGG CGGCAAAACG
CTGCTAGAAA AACCGCTATT GAATATGCTT GCCAACCCGC AAATCGACCA GCCGTTTTGC
CTTTGCTGGG CCAATGAAAA TTGGACACGC CGATGGGATG GGCTCGATGA TGACATTCTA
ATAGCGCAAC ATCACAGCGA AGAAGACGCC ATTGCTTTCT TGCGCTACAT AAACACCTAT
TTTCGCGACG ACAGATATAT AAAAATAGAC GGAAAACCGT TACTCCTCGT CTATCGACCG
AGCATCATAC CCGACATCAC CCACATCCAA GAAGTGTGGA GAAAGCATGC CACCGAACTC
GGGTGGCCCG GCATTTATCT CGTCTCCGCG CAGACCTTCG GCCAAAAGGA TCCGCGCGAC
TTCCACTTCG ACGCCGCGGT CCAGTTTCCG CCCCAACATG CTGCACCTTG CGCAGCATTC
CATTCCGAAA CACCCAACCT AGCCGATGAT TTCGAAGGCT GTGTGTTCGA TTATCACAAC
GTCGCAACTC AATTCTGCGA CAGTCCTGAG GTCGACTACA AGCTGTTCCG AAGCGTCACA
CTCGCCTGGG ACAACTCCGC CAGGCGCGGG AAACGGGCGA CGATCCTGAG AAATTTCAGT
CTGACCAGCT ACGCGCAATG GCTTTTGACA GCCTGCAAGG CAACACTCGC CGACCACAAT
CTTACTGAAA ACGAAAGGCT CGTCTTTATC AATTGCATGG AACGAATGGG GCGAGGGCAC
TTATCTGGAG CCGGACACCA AATATGGCTT TGGATACCTG GAAGCCACAA AAAAGGCGCT
GAATGCATCC TCGGATGGAA AACCCAGGCT ATCCGTAATC GTTCCAAACT ATAA
 
Protein sequence
MNNTPVEPVR LTPDNPLVSV VIRTKDRPKR LAEAVRSVRE QTHRPLEVIV VNDGGWPLPE 
EALKEQAGDV ELILIQLEQN QGRSIAANTG LQAATGRYLC FLDDDDRFLP DHIELLSSCL
EHIEHRVCYS DAELCWQVYN VETGEFDIVN RQVFGSRDFN LAELLCGNYI PLNTLLFDRQ
VLLQVGGFDP QFDIYEDWDL LLRVGSLYPF YHLPRVTAQY NQWSKDHQNF YNHFSQTGQA
PLLHITGYDR LIEKNRHLFS ADAARHLLSV INRLRAIEAE LAQRDAVPAE SPLTRLTAQL
EQTLATIGPS FATLEGRLAD LDACQRDALV SATSKLNEGV SDLDGLRSQV AHLCMEFENA
SGKIDTSQAE LQKTATLLVQ TQNLLEDYRK EIARTTELNR LISERDSELR TLRQTIARNE
AEITALHKER NEANATISAL DEKVAHAETE KASLHQIIAQ TEARLASVYG STSWRIAAPL
RAVSVAVRWL LRNTRRAFLL MWWLCTGQYA RATNHAFPHL WRHVPPRLKM IIPPRLAGSV
KRRLHLADIT APRPLENQPD NSHPDTTAEA HVTLDVPERH YVDLSSEPVA HTPIKAIAFY
LPQFHTIPEN DKWWGEGFTE WTNTRKGKPL FDGHHQPRVP LHLGYYNLEN IEVYAQQVQL
ARKAGLYGFC FYFYWFGGKT LLEKPLLNML ANPQIDQPFC LCWANENWTR RWDGLDDDIL
IAQHHSEEDA IAFLRYINTY FRDDRYIKID GKPLLLVYRP SIIPDITHIQ EVWRKHATEL
GWPGIYLVSA QTFGQKDPRD FHFDAAVQFP PQHAAPCAAF HSETPNLADD FEGCVFDYHN
VATQFCDSPE VDYKLFRSVT LAWDNSARRG KRATILRNFS LTSYAQWLLT ACKATLADHN
LTENERLVFI NCMERMGRGH LSGAGHQIWL WIPGSHKKGA ECILGWKTQA IRNRSKL