Gene CA2559_02110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_02110 
Symbol 
ID9295907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp513084 
End bp514679 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content33% 
IMG OID 
ProductSulfate transporter 
Protein accessionYP_003715188 
Protein GI298207009 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAGA ATTTAAAGCA CGATTTTCCA GCTAGTGTCG TGGTATTTTT TGTTGCTATT 
CCTCTTTGTC TTGGTATAGC ATTAGCTAGT GGAGCACCAC TATTTTCAGG CTTAATTGCA
GGTATTATCG GTGGTATTGT AGTTGGAGCC ATTAGTGGTT CTAGTTTAGG GGTAAGTGGT
CCTGCAGCAG GTCTTGCCGC TATTGTACTA GCTGCAATTG CAAGTTTAGG GAGTTATGAA
AACTTCTTAG TAGCTGTAGT TTTAGGTGGA GCAATTCAAA TTGTTTTTGG TCTTCTAAAA
GGTGGTATAA TAGGATATTA CTTTCCATCA TCCGTAATTA AGGGTATGCT AACAGGTATT
GGTATTATCA TTGTACTTAA GCAAATTCCT CATTTCTTTG GTTATGATGC AGATCCAGAA
GGAGATTGGG CATTTTTACA AATGGACGGT GAAAATACAT TTTCTGAATT AGGAAATATT
ATTGGTAATA TAAGTCCTGG CGCTACTTTA ATAGCAGTTA TTGCTATGGT GATTTTAATA
TTATGGGAAG CAGTTTTATC TAAAAAAGGT AAAATATTTC AATTAGTACA AGGACCTTTA
GTAGCAGTAT TTGCAGGTAT TGTTTATTTT GTAGCTACTC AAGATATTTC AGGTTGGAGT
ATTAGTCAAG AACACTTAGT GAGTGTTCCT ATTCCAGATG ATTTTGACTC ATTCCTAGGT
CAGTTTACCT TCCCAAATTT TGGTGTTATT GGTAATCCAG ATGTTTGGAT AACAGCATTT
ACAATTGCTT TAGTAGCAAG TTTAGAAACA CTACTTTGTG TTGAGGCTAC AGATAAGCTA
GACCCTAAGA AAAGAACAAC ACCTACTAAC AGAGAGTTAT TTGCACAAGG TACAGGTAAC
ATGATATCTG GTTTAATTGG TGGTCTACCA ATTACACAGG TAATTGTAAG AAGTTCGGCT
AACATACAGT CTGGTGGACA AACAAAATTA TCAGCAATCC TTCACGGTTT CTTATTATTA
ATATCTGTTA TATTAATCCC TAATATTCTA AACCTTATTC CTTTATCTGT ATTGGCAGCT
ATTCTATTCT TAGTAGGATA TAAATTAGCT AAGCCATCTA CCTTTAAAAG AATGTTTGTT
TTAGGTTGGA AACAATTTGT ACCATTTATA GTTACTGTAA TAGGTATTGT GTTTGGAGAT
TTATTAATTG GTATTAGTTT AGGACTTGCA ATTGGTATTG TAGTAATTGT AATAAAAAGC
TACCAAAACT CTCACTTTTT ACATATCGAA GAACCATCAA ATGGTAAAAA CCATGTTAAG
ATGACTCTTG CAGAAGAAGT TACTTTTATT AATAAAGGAG CAATTTTGAA AGAATTAAAC
AATCTTGAGA GTAATTCATT CTTAGAAATT GATACGAGAA AGGCTAAATA TTTAGATCAT
GATATTATTG AGATATTAGA CGATTTTAAA TTTAGAGCAG AAGAGCGTAA CATTCATATA
AAGATTATTT CTGAAAGAGG AATCACAGAA AATCCAGAAA GCTACGAGCA GTTCTTTAAT
CAAGAGAAAA AACAACCTTT TAGACGAGGA GCCTAA
 
Protein sequence
MFKNLKHDFP ASVVVFFVAI PLCLGIALAS GAPLFSGLIA GIIGGIVVGA ISGSSLGVSG 
PAAGLAAIVL AAIASLGSYE NFLVAVVLGG AIQIVFGLLK GGIIGYYFPS SVIKGMLTGI
GIIIVLKQIP HFFGYDADPE GDWAFLQMDG ENTFSELGNI IGNISPGATL IAVIAMVILI
LWEAVLSKKG KIFQLVQGPL VAVFAGIVYF VATQDISGWS ISQEHLVSVP IPDDFDSFLG
QFTFPNFGVI GNPDVWITAF TIALVASLET LLCVEATDKL DPKKRTTPTN RELFAQGTGN
MISGLIGGLP ITQVIVRSSA NIQSGGQTKL SAILHGFLLL ISVILIPNIL NLIPLSVLAA
ILFLVGYKLA KPSTFKRMFV LGWKQFVPFI VTVIGIVFGD LLIGISLGLA IGIVVIVIKS
YQNSHFLHIE EPSNGKNHVK MTLAEEVTFI NKGAILKELN NLESNSFLEI DTRKAKYLDH
DIIEILDDFK FRAEERNIHI KIISERGITE NPESYEQFFN QEKKQPFRRG A