Gene Hhal_0461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0461 
Symbol 
ID4711518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp527727 
End bp529391 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content71% 
IMG OID639854920 
ProductNa+/Pi-cotransporter 
Protein accessionYP_001002051 
Protein GI121997264 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1283] Na+/phosphate symporter 
TIGRFAM ID[TIGR00704] Na/Pi-cotransporter 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.790471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGGCA TGCAGGTCTT CACGTTCCTG GGCGGTATCG GCCTGTTTCT GCTGGGTATG 
CGGCTGATGA GCGACGGCCT GCGTGCTGCT GCCGGGGGCG CGCTGCGCGA TATCCTTGCC
GCCTCGACGC GCTCGCGGTT GCGGGGGCTG TTCTCCGGGG TGTTGATTAC CGCCGCGGTG
CAGTCGTCGA GCGCGGTGAT CTTCGCCACC GTCGGCTTCG TCAACGCCGG GCTGCTGACC
CTGACCCAGG CCATCGGGGT GATCTACGGC GCCAGCCTGG GGACCACCCT GACCAGCTGG
ATCGTGGCCC TGATCGGCTT CAACGTCGAC CTCCAGGCGC TGGCGCTGCC CGCCGTCGGC
ATCGGCATGG GGCTGCGGGT GGTTGCTGGC TCCCGGCGTC GGCAGGCGCT CGGGGATGCC
ATCGCCGGGC TCGGGCTGTT CTTCCTGGGG CTGGATATCC TGCGCGGGAC GTTTGCCGAC
CTCGGTGACC CGGGCATGCT CGCCGTCCTG GCCGGCTACG GGGTGCTGAG CCTGCTGCTG
TTCGTCGCCA TCGGCATGGT GCTGACCACG CTCACCCAGT CTTCCTCGGC GGCGCTGGCC
ATCACCCTGA CCGCTGCTGC CGGCGGGCTG GTCCCCGAGC AGGCGGCGGC GGCCACGGTG
ATCGGAGCAG CGGTGGGGAC TACCTCGACG GCGGTCTTCG CCACGTTGGG GGCCACCGCC
AATGCCCAGC GCACCGCCTC GGCCCAGGTG ATCTTCAACA CGGTGGCCGG GATGGCGGCG
TTGGCGGCCC TGGTCCCGCT GCTGGAACTG GCCCACCACA TCGTCGGCTG GCTGGGTCTG
GCGGCGCACA AGGCGGTGGT GCTGGCCGTT TTTCACACGC TGATGATGCT CCTCGGCCTG
GCCCTGATGT GGCCGGCCAC ACCCCGCCTG GTGGCGTGGC TGGAGCGGCG CTTTCGCCGG
GGCGATGACG GCCACAGCCG GCCGCGCTAC CTGGATGACA ACGTCCTCGC CACCCCGGCC
CTGGCCCTGG ATGCCGCGCG CCTGGAGGCC GAGCGCACCG GGGCCATAGC GCGCGGTATG
GTCACCGCGG CGATCAGTGG CGGCGGGCCG GCCCAGCGCG AGTGGCTCGA GGGCGAGTAC
CGGGTCCTGG AGCAGTTGAC CCTGACCATT AACGAGTTCG CCAACCGCAT CGAGCGCAGC
CAGAGTGATC CGGCCTTCGC CAACAGCCTG GCCCACCTGC TGCGGGTGAC GCAGTACCAG
CAGGACATGG CCGAGCGGGC TGTGGCCCTG GCGAAGCTGG ACGCGGGGGG TGAGACGCGC
ATCGACGATC CGGAGTTGGC CGCCGCAGTG GACCGCCTGC TCGCCGAGGC AGTTCGCGGC
ATCGAGGCCA CCGGTGAGGA CCTCTCCGCC TGGGATCGGC AGGCAGTGAA GGCCGTGCGG
AAGGGCTTCG ACCGCCAGTA TCAGCCGATC AAGGAGCGCC TGCTGCGTGC CGGCGCCGAG
GGGCGCCTGC CGGTCCGGCG GATGGTGGCG GTGCTCGATC GCCTCTCGGC GGTGCGCCGG
GCGCTGGATC AGGCCACCAA GTCCGCCCGT TATCTGCGCA AGTTCCAGAA CGCCGAACGC
GAACTGGTCG ACAGCGGGGG TGCGGACGAG GTGGGACTCC CCTGA
 
Protein sequence
MGGMQVFTFL GGIGLFLLGM RLMSDGLRAA AGGALRDILA ASTRSRLRGL FSGVLITAAV 
QSSSAVIFAT VGFVNAGLLT LTQAIGVIYG ASLGTTLTSW IVALIGFNVD LQALALPAVG
IGMGLRVVAG SRRRQALGDA IAGLGLFFLG LDILRGTFAD LGDPGMLAVL AGYGVLSLLL
FVAIGMVLTT LTQSSSAALA ITLTAAAGGL VPEQAAAATV IGAAVGTTST AVFATLGATA
NAQRTASAQV IFNTVAGMAA LAALVPLLEL AHHIVGWLGL AAHKAVVLAV FHTLMMLLGL
ALMWPATPRL VAWLERRFRR GDDGHSRPRY LDDNVLATPA LALDAARLEA ERTGAIARGM
VTAAISGGGP AQREWLEGEY RVLEQLTLTI NEFANRIERS QSDPAFANSL AHLLRVTQYQ
QDMAERAVAL AKLDAGGETR IDDPELAAAV DRLLAEAVRG IEATGEDLSA WDRQAVKAVR
KGFDRQYQPI KERLLRAGAE GRLPVRRMVA VLDRLSAVRR ALDQATKSAR YLRKFQNAER
ELVDSGGADE VGLP