Gene CA2559_12043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_12043 
Symbol 
ID9297898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp2618105 
End bp2619748 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content34% 
IMG OID 
Productsodium iodide symporter 
Protein accessionYP_003717150 
Protein GI298208971 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.48339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTTACG GTGCCTATAA AACTAAGGGT AGTAAGAATG TACAAGATTA TATAAAAGGT 
AATAATGAAG CACAATGGTG GACCATAGGT TTATCTGTCA TGGCTACACA AGCAAGTGCC
ATCACATTTT TATCTACACC AGGACAAGCC TTTCATAGCG GTATGGGTTT CGTTCAATTT
TATTTTGGTT TACCTATCGC CATGGTAATT ATTTGCTTGG TGTTTATTCC TATTTACCAT
CGCTTAAAGG TATACACGGC TTATGAATAT TTAGAAAGTA GGTTTGATCA AAAAACCAGA
ACGCTTACAG CAATACTATT TCTTGTACAG CGTGGTTTAG CAGCAGGTAT AACCATATTT
GCTCCTGCAA TAATTTTATC TGCAGTATTA GGTTGGGATT TACTCACGCT TAATATTATT
ATTGGTGTTC TTGTTATTAT TTATACTGTA TCTGGTGGAA CTAAAGCTGT AAGTATTACA
CAAAAACAGC AAATGGCCGT AATCTTTGCA GGTATGTTTG CTGCCTTCTT TATTATTGTA
AGCAAACTAC CAGAAGATAT TACTTTTACT AAAGCCTTAG ATATTGCTGG TGCAAGTGGA
AAAATGGAGA TCCTCGATTT TTCTTTTGAT CTTAGTAATA GATATACTAT TTGGACAGGA
TTTTTAGGAG GTACCTTTTT AATGTTATCT TATTTTGGGA CAGACCAAAG CCAAGTACAG
CGATATTTGT CTGGTCGATC TGTTCGTGAA AGCCAATTAG GACTATTATT TAATGGTCTT
CTTAAAGTAC CAATGCAATT CTTTATTCTA TTGGTTGGTG TTATGGTATT TGTGTTTTAT
CAATTTAACG CATCTCCTAT AAACTTTAAT CCAGCTGCTC ATGAAGCTGT TCAGAATTCT
GAATATGTAC AAGAATACAC TGCACTTGAA AATCAATTAA AAACAATTCA AGCAGAACAG
AATATTACCA GTTTGGCGTA TGCAGAAGTT TCAGATCAAA CTAGTTCAGA AGACTACAAG
GCGTTAAAAT CTCAATTGGC TCAATTAAAC AAAGAAGAAG TTGCCGTACG CGAAAAAGCA
AAAACAATAA TCACGAGTGC AGACGCTACA ATTGAAACTA ATGACAAGGA TTATGTTTTT
ATAAATTTTA TACTAAACAA TCTTCCAAGA GGTCTTATAG GCTTGCTTTT GGCTGTAATT
TTATCTGCTG CTATGAGTAG CACGGCATCA GAATTAAATG CATTGGCATC TACCACAGCT
ATGGATTTGT ATAAGCGTAA CGTTACTACA GAAAAAAATG ACATGCATTT TGTGAAAGCC
TCTAAATGGT TCACATTAGG TTGGGGAGTT TTAGCCATAT TAGTGGCTTG TGTCGCAAAT
TTATTTGACA ATCTTATACA GCTCGTAAAT ATTATAGGTT CAATATTTTA TGGAAATATT
CTTGGTATCT TCCTACTTGC CTTTTTTGTA AAGTATGTAA AAAGCAAGGC AACATTTGTA
GCTGCCATAC TTACACAAGC AATTATTGTG TTTGTTTGGT ATATGGATTA CCTGCCTTAC
CTATGGCTTA ATGTTTTGGG TTGTGGTATT GTAATGGCAA TTGCTATCCT ATTGCAAACA
ACTTTTAAAG CTAAAGAACA TTAA
 
Protein sequence
MAYGAYKTKG SKNVQDYIKG NNEAQWWTIG LSVMATQASA ITFLSTPGQA FHSGMGFVQF 
YFGLPIAMVI ICLVFIPIYH RLKVYTAYEY LESRFDQKTR TLTAILFLVQ RGLAAGITIF
APAIILSAVL GWDLLTLNII IGVLVIIYTV SGGTKAVSIT QKQQMAVIFA GMFAAFFIIV
SKLPEDITFT KALDIAGASG KMEILDFSFD LSNRYTIWTG FLGGTFLMLS YFGTDQSQVQ
RYLSGRSVRE SQLGLLFNGL LKVPMQFFIL LVGVMVFVFY QFNASPINFN PAAHEAVQNS
EYVQEYTALE NQLKTIQAEQ NITSLAYAEV SDQTSSEDYK ALKSQLAQLN KEEVAVREKA
KTIITSADAT IETNDKDYVF INFILNNLPR GLIGLLLAVI LSAAMSSTAS ELNALASTTA
MDLYKRNVTT EKNDMHFVKA SKWFTLGWGV LAILVACVAN LFDNLIQLVN IIGSIFYGNI
LGIFLLAFFV KYVKSKATFV AAILTQAIIV FVWYMDYLPY LWLNVLGCGI VMAIAILLQT
TFKAKEH