Gene CA2559_07585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_07585 
Symbol 
ID9297013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp1685934 
End bp1687073 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content32% 
IMG OID 
Productglycosyl transferase, group 2 family protein 
Protein accessionYP_003716276 
Protein GI298208097 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.547919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACAAC TCTCTGTAAT CATACTTAAT TACAATGTCA AGCATTTTTT AAAGCTTTGC 
CTACAAAGTG TGGTGCAGGC TAAAGAAAAT ATACAAGCCG AAATTATTGT TGCAGACAAC
GCATCTAAAG ATGGAAGTAT GGAAATGGTT GCTCAGGATT TTCCAAATGT TATAAGACTC
GAAAATAAAG AAAACTTAGG GTTTAGTAAA GCTAATAATC TAGCTGTAAA AAAAGCTAAA
GGTAAATACA TCTGTATCTT AAATCCAGAT ACTGTAGTGC CTGAGCAAAT CTTCTCAAAT
TTATTGAAGT TCGTTAAAAC AGTTCAAGAT TTTGGAGCGG TAGGAGTAAA GTTAATCGAT
GGTAAAGGGC AATTTTTACC AGAAAGCAAA CGGCAAATAC CTACTCCTAA AGTGGCCTTT
CAAAAAATGG TAGGCAACGC CACTAACTAT TATGCAAGTA ATTTAGAATC TAATGATATT
GGTTGTGTAG ATGTTCTTGT AGGCGCATTT ATGTTTATGT CCAGACAGCG TTATTTACAA
GTTGGAGGTT TTGATGAAGA CTACTTTATG TATGGTGAAG ATATAGACCT TAGCTACAAA
CTACTTAAGT CTGGTTATAA AAACTACTAT TACGGAAAAG ATTCAGTAAT TCATTTTAAA
GGAGAAAGTA CTACTAAAGA TGAAGTGTAT AGAGCACGTT TTTATGGTGC AATGCAACTT
TTTTATAAAA AACATTTTAG CAATAGTAAG TTTACTAATC TAATTGTAAA AGCTGCACTC
AAAGTAGTTA AGAAGGCAAA TAAGGCTCAA GGTTTAGATA CAGATAAGGA AATATCTTCA
AATTTATTTA TCTATATCGG TAATTCTTCA GACTGTGTTG CCATCTTGTC TAAACTAAAA
AATAAGCAAG TACAACATCT TTCTTTAAAA GAGCTACAGA AGCTAACTTT AAAAAATTCA
CAGCTGTTTT TAGACTCTCA ATTTTTTAAT TTCAAAGAAA TTATTAGTCT TCTAGAGCAG
TATGGGCACC ATAATAATAC ATTTAGGATA AAGTTGAAAT CTTCTAATGT GTTGATTGGT
AGTGATACTA GCACAGGAAA AGGTGAGGTT TTGGTTTTAG AACTAGATAA AATTCAATAA
 
Protein sequence
MVQLSVIILN YNVKHFLKLC LQSVVQAKEN IQAEIIVADN ASKDGSMEMV AQDFPNVIRL 
ENKENLGFSK ANNLAVKKAK GKYICILNPD TVVPEQIFSN LLKFVKTVQD FGAVGVKLID
GKGQFLPESK RQIPTPKVAF QKMVGNATNY YASNLESNDI GCVDVLVGAF MFMSRQRYLQ
VGGFDEDYFM YGEDIDLSYK LLKSGYKNYY YGKDSVIHFK GESTTKDEVY RARFYGAMQL
FYKKHFSNSK FTNLIVKAAL KVVKKANKAQ GLDTDKEISS NLFIYIGNSS DCVAILSKLK
NKQVQHLSLK ELQKLTLKNS QLFLDSQFFN FKEIISLLEQ YGHHNNTFRI KLKSSNVLIG
SDTSTGKGEV LVLELDKIQ