Gene CA2559_06520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_06520 
Symbol 
ID9296799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp1452629 
End bp1453729 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content35% 
IMG OID 
ProductN-acetylglucosaminyl transferase 
Protein accessionYP_003716063 
Protein GI298207884 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACT TAAAATTCAT ATTATCAGGT GGCGGAACTG GAGGACATAT ATATCCTGCA 
ATAGCAATTG CCAATGAATT GAAAAATCGT TACCCAGATG CAGAGTTTCT ATTTGTAGGA
GCTAAAGACC GCATGGAAAT GGAAAAAGTT CCAAACGCTG GTTACAATAT CAAGGGACTT
TGGATAAGTG GTATACAACG TAAACTCACC TTCACAAATC TTATGTTTCC ATTCAAACTA
TTGTCTAGTT TATGGAAAAG TAGAAGCATT ATAAAAAGAT TTAAACCTGA CGTGGTAATT
GGTACAGGAG GTTTTGCGAG TGGGCCATTG CTTAAAATGG CAAATAGCAA GAACATTCCT
ACCCTTATAC AAGAACAAAA TAGTTACGCA GGTATTACCA ATAAATGGTT AGCAGATAAG
GCTAACAAAA TATGTGTGGC TTATGACCAT ATGGAAAAGT ATTTTCCAGC AGAAAAAATT
ATAAAAACTG GCAATCCTGT TAGGCAAGAC ATTAAAGATC TTGATTCAAA AAGAGCAGAA
GGCATAGATC ATTTTGAATT AGATGAAACA AGAAAGACAG TATTAGTTCT CGGTGGAAGC
CTTGGTGCTA AGCGTATAAA TGAGTTAATA GCTAATCACG CTAAAGATTT TGAGGAAACA
GGTGTAAACG TTATTTGGCA AACTGGTAAG TTATACTATG AACAATATAA AACGCTTGAA
GAAAATAAAC GTTTACAAGT GAAGGAGTAT ATAAACCGAA TGGATCTAGC ATATAGTGTA
GCAGATATAA TTATTAGCCG TGCTGGTGCA GGATCTGTAA GTGAGCTTTG TATCGTAGGA
AAACCTGTGA TCTTAATTCC TTCTCCAAAC GTAGCAGAAA ATCATCAAAT GAAAAATGCT
ATGGCATTAG CTGTGGAAGA AGCTTGCTTA ATTATGAAAG AAAGCGAAAT GGAAGAGCAA
TTTAAAAGAC AATTTATAAA TCTTTTAGAA GATGAAGCAA TGCAAGCAAA GCTTTCAGAA
AATATAAAAA AACTAGCAAG GCCCAATGCA ACTAAAGATA TTGTAAACGA AATTGAACAT
TTAATTAATC ATACTGCGTA G
 
Protein sequence
MSNLKFILSG GGTGGHIYPA IAIANELKNR YPDAEFLFVG AKDRMEMEKV PNAGYNIKGL 
WISGIQRKLT FTNLMFPFKL LSSLWKSRSI IKRFKPDVVI GTGGFASGPL LKMANSKNIP
TLIQEQNSYA GITNKWLADK ANKICVAYDH MEKYFPAEKI IKTGNPVRQD IKDLDSKRAE
GIDHFELDET RKTVLVLGGS LGAKRINELI ANHAKDFEET GVNVIWQTGK LYYEQYKTLE
ENKRLQVKEY INRMDLAYSV ADIIISRAGA GSVSELCIVG KPVILIPSPN VAENHQMKNA
MALAVEEACL IMKESEMEEQ FKRQFINLLE DEAMQAKLSE NIKKLARPNA TKDIVNEIEH
LINHTA