Gene SeD_A1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1622 
Symbol 
ID6874134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1565923 
End bp1567092 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content52% 
IMG OID642784767 
Producttetratricopeptide repeat protein 
Protein accessionYP_002215435 
Protein GI198242977 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000649442 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.66656e-22 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTTGGAGT TGTTATTTCT GCTGTTGCCT GTAGCCGCTG CCTATGGGTG GTATATGGGT 
CGCAGAAGTG CGCAACAAAC AAAACAGGAT GAAGCTAACC GCCTGTCGCG CGATTATGTC
GCAGGGGTTA ACTTCCTGCT GAGTAACCAA CAAGATAAAG CGGTGGATCT GTTCCTCGAT
ATGCTTAAAG AGGATACGGG CACCGTTGAG GCTCATCTCA CTCTCGGTAA TCTGTTTCGC
TCTCGCGGCG AAGTCGATCG CGCCATTCGT ATTCATCAAA CGCTCATGGA AAGCGCTTCA
TTGACCTATG AACAGCGTTT ACTGGCTGTT CAGCAACTGG GGCGCGACTA TATGGCCGCC
GGTTTATATG ACCGCGCGGA AGATATGTTT AACCAACTTA CCGACGAAAC GGAATTTCGC
GTAGGCGCGT TACAGCAGCT CTTGCAAATC TATCAGCTAA CCAGCGACTG GCAAAAGGCG
ATCGAAGTAG CAGAACGGCT GGTGAAGCTG GGCAAAGATA AACAACGTAT CGAAATCGCC
CATTTTTACT GTGAGTTAGC GTTACAGCAG ATGGGCAACG ACGACATGGA TCGCGCGATG
GCGTTGCTGA AAAAAGGTGC CGCCGCAGAT AAAAATAGCG CCCGGGTGTC TATCATGATG
GGGCGCGTTT ATATGGCGAG AGGGGATTAC GCCAAAGCGG TCGAAAGCCT GCAACGTGTG
ATCGTTCAGG ATAAAGAGCT GGTCAGCGAA ACGCTGGAGA TGCTGCAAAC CTGTTATCAA
CAGCTCGGTA AAAATGCCGA GTGGGCGGAG TTTTTACGTC GCGCCGTTGA GGAGAATACC
GGTGCTGGCG CTGAGTTAAT GCTTGCCGAT ATTCTGGAAG CGCGTGAAGG TAGTGACGCA
GCTCAAGTCT ATATCACGCG TCAGCTACAG CGACATCCTA CCATGCGGGT GTTCCATAAG
CTGATGGATT ACCATCTCAA CGAGGCGGAA GAAGGGCGAG CGAAAGAAAG CCTGATGGTA
CTGCGTGATA TGGTTGGCGA GCAGGTGCGC AGTAAACCGC GGTATCGTTG TCAGAAATGC
GGTTTTACCG CCTATACCTT GTACTGGCAC TGTCCGTCCT GCCGGGCATG GTCGACCATT
AAACCTATTC GCGGACTTGA TGGGCAGTAG
 
Protein sequence
MLELLFLLLP VAAAYGWYMG RRSAQQTKQD EANRLSRDYV AGVNFLLSNQ QDKAVDLFLD 
MLKEDTGTVE AHLTLGNLFR SRGEVDRAIR IHQTLMESAS LTYEQRLLAV QQLGRDYMAA
GLYDRAEDMF NQLTDETEFR VGALQQLLQI YQLTSDWQKA IEVAERLVKL GKDKQRIEIA
HFYCELALQQ MGNDDMDRAM ALLKKGAAAD KNSARVSIMM GRVYMARGDY AKAVESLQRV
IVQDKELVSE TLEMLQTCYQ QLGKNAEWAE FLRRAVEENT GAGAELMLAD ILEAREGSDA
AQVYITRQLQ RHPTMRVFHK LMDYHLNEAE EGRAKESLMV LRDMVGEQVR SKPRYRCQKC
GFTAYTLYWH CPSCRAWSTI KPIRGLDGQ