Gene Acid345_3923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3923 
Symbol 
ID4071306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4638796 
End bp4640052 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content62% 
IMG OID637985949 
Productmajor facilitator transporter 
Protein accessionYP_592997 
Protein GI94970949 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.768461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.191124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGATA AGTCCCTGCC TAATCGCTGG TTGATTGCCG TTGCCGCTGT GATCATGCAG 
ATTTGCCTGG GCGCTGTGTA CGGCTGGAGC GTTTTCGTGA AGCCTTTGGT CGGCGGTGAG
CACTGGACGC TGACCGAGGT TTCTCTGAAC TTCACCATCG CGCTGGCGTT CCTCGGCGTG
GGCACGGTGA TCGGTGGCTT GTGGCTGGAT CGCGTCGGTC CGCGCATGGT TGCGACGGTC
GCCGGCGTGC TCTACGGCAT TGGATACATC GTGGCGAGCA TCGGCGTAAA GAACCACTCG
CTTACGGTGC TCTACATCGG TTACGGCGTG TTAAGCGGCG TCGGCATGGG CATGGGCTAC
ATCTGCCCGG TGGCGACGAT CGCCAAGTGG TTCCCCGATC TCCGCGGCTT GATGACCGGC
GTCGCGGTTG CGGGGTACGG TGCCGGCGCG CTCATCATGA GCCCGATCGC GGCGAAGTTG
ATCGTGTCAC GGGGGATTCC TTACACATTC CTCGGCATGG GTATTGTGTA CGGGGTGCTG
GTGATCCTTA CCGCGCAGGC ATACGAAAAT CCGCCGGAGG GCTGGCGTCC GGTCGGCTGG
CAGCCAACTT CTGCGGTGTC GAAATCGGCG ACGACCGAGA CCTTCACCGT CGCCGAAGCA
ATGCGCACGT GGCAATTTTG GCTGCTGTTT GCCATGCTCT TCCTCAACAC CTCTGCGGGA
ATCATGATCA TCAGCCAAGC CTCGCCTATG GCACAGCAGA TCGTGGGGCT CACAGCGATT
TCCGCGGCCG GAATCGTCGG CTTAATCTCG ATCTTTAACG CCGCAGGCCG CGTGTTCTGG
GCATGGATGT CGGACCTCAT CGGACGTGGC ACGGTGTACT TCCTGCTCTT CGCGATCCAG
GCGGTGATTT TCTTCGCGCT GCCGCACCTT ACCACGCGGG CACTTTTCGC TACCGCCGTG
GCGATCGTGG GTCTCTGCTA TGGAGGCGGC TTCGGTACCA TGCCGTCGTT CACCGCCGAC
TTCTTCGGCG CGAAATTTAT GGGTGGGATC TACGGATGGA TCCTGCTGGC GTGGGGCGCA
GCAGCCATCC CGTCGCCGCT GATGATTGCG CACATCCGCC AGACCACGGG GCAGTATCGC
CAGGCGATTT ATGTGATCGG GATCGTGATG GTCGTGTCGC TGGTGCTGCC GATCTTGGCG
CGGAAGCGAC CACGCAAGGG AGCGGTGGTG CCCATCAACC AGTCGCGCGC GGCGTAA
 
Protein sequence
MPDKSLPNRW LIAVAAVIMQ ICLGAVYGWS VFVKPLVGGE HWTLTEVSLN FTIALAFLGV 
GTVIGGLWLD RVGPRMVATV AGVLYGIGYI VASIGVKNHS LTVLYIGYGV LSGVGMGMGY
ICPVATIAKW FPDLRGLMTG VAVAGYGAGA LIMSPIAAKL IVSRGIPYTF LGMGIVYGVL
VILTAQAYEN PPEGWRPVGW QPTSAVSKSA TTETFTVAEA MRTWQFWLLF AMLFLNTSAG
IMIISQASPM AQQIVGLTAI SAAGIVGLIS IFNAAGRVFW AWMSDLIGRG TVYFLLFAIQ
AVIFFALPHL TTRALFATAV AIVGLCYGGG FGTMPSFTAD FFGAKFMGGI YGWILLAWGA
AAIPSPLMIA HIRQTTGQYR QAIYVIGIVM VVSLVLPILA RKRPRKGAVV PINQSRAA