Gene CHU_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_1047 
Symbol 
ID4184383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp1207878 
End bp1209344 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content40% 
IMG OID638071045 
Productsodium/solute symporter 
Protein accessionYP_677664 
Protein GI110637457 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.322443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.451489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCTC AATTAATTAT TACAATATTA GGAGTGTATT TCGCTGCTTT ACTGCTGATT 
GCCTGGTTTA CGTCCAGAAA TGCAGATTCA GATGCTTTTT TTACAGGAAA TAATCAATCT
GCCTGGTATC TGGTAGCTTT TGGTATGATC GGCACTTCGG TTTCAGGAGT AACATTCATT
TCCGTACCGG GACAGGTTGC CGCACAGGGT TTTTCGTATT TTCAGCTCAT ATTGGGCAAT
ATGTTTGGAT ATTTTGTAGT AGCAGCCGTT TTGATGCCTA TTTACTATAA ATCGAACATG
GTTTCCATTT ATACCTTTTT AGAAGAACGG TTTGGATTCT GGTCCTACAA AACCGGTTCT
GCATTTTTCT TATTATCCCG TACAATCGGG TCTTCGCTTC GCCTGTATCT GGCAGCAGAA
GTATTACATA CGTTTCTGTT CCGTGAGCTG GGTGTTTCAT TTATTGTTAC GGTTGGTGTT
ACCATTCTGC TGATCTGGGT GTATACATTT AAGGGCGGCG TTAAAACCAT TATCTGGACA
GACACCTTTC AAACGTTCTT TCTGGTAGGT GCTGTTATCA TCAGTGTTGT TGTTATTTCA
AACCAGTTAG GCTGGGGTAC GGTTGAAATG ATCAAAGAAG TAGATGCAAG TAAATATTCT
ACCATCTTTC ACTTTGAAGA CATAAAATCC CCGCAGTATT TCTGGAAACA GTTTATATCG
GGCATTTTTA TGACGATCGT GCTGACCGGT CTGGATCAGG ATCTGATGCA AAAAAATCTT
ACCTGTAAAA ATCTGGGTGA AGCACAAAAG AATATGTACT GGTTCTCTGT GATACTGGTA
GCAGTAAATT TTTTATTCTT AACCTTAGGT GCCTTGCTCT ATATCTATGC AGATCAAAAG
GGGATAGCAG TTCCTGCACA ATCCGATTTC TTTTATCCGA TTCTTGCCTT AAAATATCTG
GGTGTAATTG CCGGCGTATT CTTTTTGCTG GGAATAACGG CTTCTTCGTA TGCCAGTTCT
GACTCAGCAC TGACAGCGCT GACAACCGCA TTCTGTATTG ACTTCCTCAA TTTTAATAAA
GGCAATGTTG TAAATAAAAA CCGCACACGT ACTTATGTGC ACATAGGTTT TTCTATGCTC
TTTTTTGTGA TTATTGTTTT GTTCAAAGAA TTCAATGAAG GTACAACGGT TATAAAAACA
GTATTAAAAG CGGCCGCTTA TACTTATGGT CCGTTGCTGG GCATGTTTGC CTTTGGTATC
TTCAGCAAGC ACAGAACAGT TACCGATAGA TGGGTTCCTG TTGTATGTAT AGTATCGCCG
CTGCTGACAT TCCTCGTTGT GCTGTTTATC AAAGAGGTAC TCGGGTATCA GACAGCCTTT
GAGGACTTGC TCATCAATGG CGCCATAACG ATTATAGGCT TGTTGTGTAT TTCACATGCG
CCTAAACAAC GCGATGCATT TTCATAA
 
Protein sequence
MTAQLIITIL GVYFAALLLI AWFTSRNADS DAFFTGNNQS AWYLVAFGMI GTSVSGVTFI 
SVPGQVAAQG FSYFQLILGN MFGYFVVAAV LMPIYYKSNM VSIYTFLEER FGFWSYKTGS
AFFLLSRTIG SSLRLYLAAE VLHTFLFREL GVSFIVTVGV TILLIWVYTF KGGVKTIIWT
DTFQTFFLVG AVIISVVVIS NQLGWGTVEM IKEVDASKYS TIFHFEDIKS PQYFWKQFIS
GIFMTIVLTG LDQDLMQKNL TCKNLGEAQK NMYWFSVILV AVNFLFLTLG ALLYIYADQK
GIAVPAQSDF FYPILALKYL GVIAGVFFLL GITASSYASS DSALTALTTA FCIDFLNFNK
GNVVNKNRTR TYVHIGFSML FFVIIVLFKE FNEGTTVIKT VLKAAAYTYG PLLGMFAFGI
FSKHRTVTDR WVPVVCIVSP LLTFLVVLFI KEVLGYQTAF EDLLINGAIT IIGLLCISHA
PKQRDAFS