Gene Meso_1154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMeso_1154 
Symbol 
ID4180745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChelativorans sp. BNC1 
KingdomBacteria 
Replicon accessionNC_008254 
Strand
Start bp1262120 
End bp1263652 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content60% 
IMG OID638067039 
Productsugar transferase 
Protein accessionYP_673715 
Protein GI110633507 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAACG CCGAACGAAA GCATCGTTTT ACGCCCGAGG GCGGGCCGAC CGAGGAGCCG 
CATCAGCGGC GCCCACGCCC CGTGCTGAGC AAAATTGCAC GGCAGGTGGC CCAGCAATAT
CGTCGCGACA CCATGTCTCC CGTCATGGTG AGCGGTGTCA TGCGCCTCAT CGAATTTGGG
GCACTGCTTG TTGCGGGGCT TCAAATCTAT GCGGTGCATG TGGGGCTCGG CACTCACCTT
TCCTGGCATT ATCCCATCGT CATCCTCGCC GTGTCGCTTC TGGCAGTCAT CCTTATGGAG
TTCTCCGACT GCTACCAGAT GCCGGCGCTT CGCAACCCCG TCACACAGGC AGGACGCATC
CTGCTCATCT GGTCGGCGAC CTTCGCGCTG CTTACGCTCG CGCTCTTCTT CCTGAAGATT
TCCGAGGAAT TTTCGCGGTT CTGGCTCGGC ATCTGGTATC TCAGCGGGCT GGCACTTCTG
CTTGTCCTTC GCCTTATCGC GTCGCGGCTC ATCCGAAGAT GGGCGCGCAA CGGGCGGATG
GAACGGCGTG CTGTGCTTGT GGGCGGCGGC AAGAATGCCG AACACCTTAT CCGCTCCATC
GAGCAACAGC CCTATAATGA TATCCGCATC TGCGGCATCT TCGACGACAG GGATGAAACC
CGCTCCCCGC CGGTCGTTGC GGGCTATCCG AAACTCGGCA ATATCGATGA GCTCATCGAA
TTTGCACGCA TTGCACGCAT CGACATGCTG ATCGTCTCTC TGCCGATCAC TGCCGAGGCA
CGCGTGCTAA CGCTGTTACG CAAGCTCTGG GTGCTGCCGA TCGATATCCG GCTCTCGGCC
CACTCGACCC ATCTGCGGTT CCGGCCCCGC GCCTATTCCT ATATCGGCGC GGTGCCGATG
CTCGATATTT TCGACAGGCC GATCAACGAC TGGGATTCGG TCGCCAAGCG CGCCTTCGAT
ATCGTTTTCA GCCTGATGGG CATCATCGTC TTTTCGCCGG TAATGCTTGC AACGGCGATC
GCCATCAAGC TCGACAGCAA GGGCCCAGTG ATCTTCAAGC AGCGGCGGCA TGGCTTCAAC
AACGAGGAGA TCGAGGTCTA CAAATTCCGC TCCATGTATG TGGAAGCCAG CGATCCCACC
GCGCGAAAGC CCGTCACCAG GGGCGACCCG CGCGTGACGC GCGTCGGCCG GTTCATTCGC
AAGACTTCGA TCGATGAGCT GCCGCAATTC TTCAACGCGC TCTTCGGCAG CCTGTCCTTG
GTAGGACCTC GGCCGCATGC GGTGGCGGCG GAGGCGCATA ACCGGTTGTT CGACGAGGTG
GTGGACGGCT ATTTCGCCCG GCACCGCGTG AAACCAGGCG TGACCGGATG GGCGCAGATA
AACGGCTGGC GCGGTGAGCT CGATACGGAG GAGAAAATCC GCAAGCGTAT CGAATACGAC
CTTTACTACA TCGAAAACTG GTCGCTCTGG TTCGATCTCA AGATCCTGCT CCTGACGCCG
ATTCGCCTTC TGGATACGAG AAACGCCTAT TGA
 
Protein sequence
MNNAERKHRF TPEGGPTEEP HQRRPRPVLS KIARQVAQQY RRDTMSPVMV SGVMRLIEFG 
ALLVAGLQIY AVHVGLGTHL SWHYPIVILA VSLLAVILME FSDCYQMPAL RNPVTQAGRI
LLIWSATFAL LTLALFFLKI SEEFSRFWLG IWYLSGLALL LVLRLIASRL IRRWARNGRM
ERRAVLVGGG KNAEHLIRSI EQQPYNDIRI CGIFDDRDET RSPPVVAGYP KLGNIDELIE
FARIARIDML IVSLPITAEA RVLTLLRKLW VLPIDIRLSA HSTHLRFRPR AYSYIGAVPM
LDIFDRPIND WDSVAKRAFD IVFSLMGIIV FSPVMLATAI AIKLDSKGPV IFKQRRHGFN
NEEIEVYKFR SMYVEASDPT ARKPVTRGDP RVTRVGRFIR KTSIDELPQF FNALFGSLSL
VGPRPHAVAA EAHNRLFDEV VDGYFARHRV KPGVTGWAQI NGWRGELDTE EKIRKRIEYD
LYYIENWSLW FDLKILLLTP IRLLDTRNAY