Gene TM1040_2488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2488 
Symbol 
ID4076853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2628115 
End bp2629692 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content57% 
IMG OID638007812 
Productglycosyl transferase family protein 
Protein accessionYP_614482 
Protein GI99082328 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.80194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.91417 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAA TCGCCTATAT TCTGCTCTGT CACAAGGATC CGGAGGCCAT CATTCAGCAG 
GCCAACCGGC TCACTGCGAC CGGCGATTTT ATGGCGATCC ACTTTGATGC CCGTGCCAAG
ACCCGCGACT ATCGTGCTAT CCGCTCAGCG CTTTCAGACA ATCCCAACGT GACGTTCGCC
AAGCGTCGGG TGAAATGTGG TTGGGGCGAA TGGTCCTTGG TGCGCGCCAC GCTCAATGCG
CTGGAGGCAG CTGTCGACGA GTTCCCGCGC GCAACGCATT TTTATATGCT GTCGGGGGAT
TGCATGGCGA TCAAGACGGC GCAATACGCG CGCGCCTTTC TCGATCAGCA CGACAAGGAT
TTCATCGAGA GCTTTGATTT CTTCGAGAGC GACTGGATCA AGACCGGGAT GAAGGAGGAC
CGGCTGATCT ATCGCCACTA CTTTAACGAA CGCACCCAGA AACGCCTGTT CTATGCCGCG
TTCGAGCTTC AGAAGAAGCT CAAACTGACG CGGGAGGTGC CCGCCGATAT TCAGGTCCAG
ATCGGCAGCC AATGGTGGTG CCTGCGCCGA CGCACCGTCG AAGCTGTGCT TGCGATGACA
CGCAAACGCC GCGACGTGAT GCGCTTTTTT GCCTCGACCT GGATTCCGGA TGAGACGTTT
TTTCAGACGC TTGTGCGCCA CCTCATTCCT GAAGATGAGA TCGAAAGTCG CACACTTACG
TTTTTAATGT TCAGCGACTA CGGCATGCCG GTGAATTTTT ATAACGATCA CTATGATCTG
TTGCTGGGGC AGGATTTCCT GTTTGCGCGC AAAATCAGTC CTGATGCAAA AGAACTGAAA
ACGCGCCTCG GGCGTCTGTA TGCCGCGCGC GATGTGGAGT TCAAAATTTC CAACGAGGGG
CGCAACCTTT ACAAGTTTCT GTCCGAGCGC GGGCGCACCG GACAGCGTTT TGCACCGCGC
TTCTGGGAAA CCGAGAGCAG CCTCGGGCGC GAGCGTGAAT TGTTGATCCT CACCTGCAAG
AAATGGCATG TGGCCAAACG TATGCTGGAG CAGATCCGCA CGCTCACCAA TACGCCCGCG
ATCGAGTATC TTTTCCACGA AGAAGGCACG CCCCTGCCCG ATCTTGGCGG CATCCAGCGC
ACCCTCGCCA AACGCACCCG TCATCGGCGC GCCCTGGTGC GGATGCTGTT TGACTATTAC
GAGACCGACC GGCTGATTAT CTGCCTTGAT CCGTCCGCGC TCGAACTGAT GCATGATTTC
TATTCAGACC GGTCCCACAC GCGGCTCCTG CGGATCGACT GCGATTTTTC GGACAGCTAC
CTCATTGGCC ACGCGCATCG GGTCGGGCTC GCCGGTGAAC ATGCGGCCAA GGCAACGCTT
GAGCGGCTGC TGCCCGCAAT CCGCAACGAC ATCAGCAATG AAATTGACCA AATCCGCGAT
GCGGGTTTTG TGCGCCATTG GACGGTCGCG GAACGCGGCC CCGAGAGCGA CAACGCTCTT
GCGGTTTCAC AGTTTCTGGA CGTGCCGGTT GAGACCGCCC TTGAGGTGGT GCGCACGCCC
TATCTGTTCG CCGACTAG
 
Protein sequence
MAKIAYILLC HKDPEAIIQQ ANRLTATGDF MAIHFDARAK TRDYRAIRSA LSDNPNVTFA 
KRRVKCGWGE WSLVRATLNA LEAAVDEFPR ATHFYMLSGD CMAIKTAQYA RAFLDQHDKD
FIESFDFFES DWIKTGMKED RLIYRHYFNE RTQKRLFYAA FELQKKLKLT REVPADIQVQ
IGSQWWCLRR RTVEAVLAMT RKRRDVMRFF ASTWIPDETF FQTLVRHLIP EDEIESRTLT
FLMFSDYGMP VNFYNDHYDL LLGQDFLFAR KISPDAKELK TRLGRLYAAR DVEFKISNEG
RNLYKFLSER GRTGQRFAPR FWETESSLGR ERELLILTCK KWHVAKRMLE QIRTLTNTPA
IEYLFHEEGT PLPDLGGIQR TLAKRTRHRR ALVRMLFDYY ETDRLIICLD PSALELMHDF
YSDRSHTRLL RIDCDFSDSY LIGHAHRVGL AGEHAAKATL ERLLPAIRND ISNEIDQIRD
AGFVRHWTVA ERGPESDNAL AVSQFLDVPV ETALEVVRTP YLFAD