Gene VC0395_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_0474 
Symbol 
ID5134297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp529190 
End bp530641 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content48% 
IMG OID640530797 
Productputative formate transporter 1 
Protein accessionYP_001215315 
Protein GI147671415 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG0517] FOG: CBS domain
[COG2116] Formate/nitrite family of transporters 
TIGRFAM ID[TIGR00790] formate/nitrite transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000188118 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGCAG CCTACTCTAA AAATCAAAAC TGCTTTTCAC CCACGGAAAT GATGGCAGAA 
GCAGAAAAGT TCGCACTGAG TAAAGCGAAA AAAACCAGCG GCATGATCTT AGGTCTTTCG
GTTATGGCGG GCGCTTTTAT CGGTTTGGCT TTTCTGTTCT ACATTACCGT CACCACAGGC
AGCGCTTCTG CCGGCTGGGG ATTGAGCCGA CTGGCGGGTG GTGTAGCGTT CAGTATGGGG
CTGATTTTGA TCGTCATCTG CGGTGGCGAG TTGTTCACAA GCTCTGTGCT ATCTAGCATC
TCATGGGCAA ATCGCGAAAT CAGTTTTGGA AAAATGCTCT CTATCTGGGG CAAGGTGTAC
GTCGGTAACT TTATCGGTGC CATTTTTCTA CTGCTTTTGG TGACAGCGGC TGGCCTTTAC
CAGCTTGATG AAGGCCAATG GGGTTTAAAT GCCCTCAATA TTGCGCAGCA CAAACTTCAT
CACACCACAG TACAAGCTTT TGCTTTAGGC ATTCTATGTA ACCTACTGGT TTGTTTGGCT
ATTTGGCTGA CCTTCAGTTC AGCGAATGCT ATGACTAAAG CGGCCATGAC CATCATGCCT
GTCGCGATGT TTGTTTCTAG CGGCTTTGAG CACTGTGTGG CCAATATGTT CATGGTTCCA
CTGGGTATTG TTATTCAAAA CTTCGCACCA GACAGTTTCT GGCAACAGGT TGGTGTGACA
GCCAGCCAAT ACAGCGATTT GAATGTCACT CAATTTATTA CGGCGAACTT AATACCGGTC
ACGCTCGGCA ACATTGTGGG TGGTGCCGTG CTGGTTGGCC TCGCCAACTG GAGCATTTAC
CGCCGCCCTC AGTTAAAAGC CGCCAATGTT GTCACGATTA CGGAAACTCA AGCACTTACG
TCAGTCAAGG AAACTCTTAT GAAAAGCACA ATTACAGTAA AAGATATGAT GAACACTCAA
CCTGTTACCC TCAGCGTTGA GATGACCACT CCAGCCGCGA TCGACACCCT ACTCGACCAC
CATTTGTCCG CTGCTCCAGT TGTCGATATG CAAGGTCGCT TGGTTGGTGT GCTCTCTAGT
CACGATGTAA TGGTTGATCT CTGGTGCCAA GACTACTTGC CAAGCCAAGA CCAAAAAGTG
GTAGATCTGA TGACTCGTGA TGTGATTGCG ATTGATATCA ACGACAAGCT GGTGGATGTT
GCGGAGTTCT TCTGTATCGA TAAAGAACAG CTATTCCCAA CCACAAGCAT GGGCATTGCC
ACTCGCTTCA ACGCTCTCTC ATTAGAAGAA CGCGCCAAAA GCATCAAGGT AAACAAACCA
CATATGCTGC CTGTTCTACA CAATGGTCAG TTAGTGGGAG TACTGGAGCG TAATGATGTG
CTTGAAGCGC TGCGCCCAAT TTATGGTGAA CGGGTAAGAA TTGTCAAAGA TAAAGCGTTG
GCTCGCGCTT AA
 
Protein sequence
MSAAYSKNQN CFSPTEMMAE AEKFALSKAK KTSGMILGLS VMAGAFIGLA FLFYITVTTG 
SASAGWGLSR LAGGVAFSMG LILIVICGGE LFTSSVLSSI SWANREISFG KMLSIWGKVY
VGNFIGAIFL LLLVTAAGLY QLDEGQWGLN ALNIAQHKLH HTTVQAFALG ILCNLLVCLA
IWLTFSSANA MTKAAMTIMP VAMFVSSGFE HCVANMFMVP LGIVIQNFAP DSFWQQVGVT
ASQYSDLNVT QFITANLIPV TLGNIVGGAV LVGLANWSIY RRPQLKAANV VTITETQALT
SVKETLMKST ITVKDMMNTQ PVTLSVEMTT PAAIDTLLDH HLSAAPVVDM QGRLVGVLSS
HDVMVDLWCQ DYLPSQDQKV VDLMTRDVIA IDINDKLVDV AEFFCIDKEQ LFPTTSMGIA
TRFNALSLEE RAKSIKVNKP HMLPVLHNGQ LVGVLERNDV LEALRPIYGE RVRIVKDKAL
ARA