Gene Huta_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0201 
Symbol 
ID8382463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp196859 
End bp198337 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content66% 
IMG OID644971259 
ProductNa+/solute symporter 
Protein accessionYP_003129122 
Protein GI257051289 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0231312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAACG TCGCCCTGCA ACTCGGGATC GTCGTCGGCT ACCTCCTGCT CGCTCTCGGC 
GTCGGGTTGG TCGCGTATCG GCTGACCGAG CGCTCGGCGG AGGACTACTA CCTGGCGAAT
CGGACGATCG GAACGGTCGT CCTCCTCTTT ACGACCTTCG CGACGCTGCT GTCGGCGTTT
ACGTTCTTCG GCGGCCCCAA CCTCGCGTTT GCGGCGGGGC CGGAGTGGAT TCTCGTCATG
GGGTTGATGG ACGGCGTCAT CTTCGGAATC CTGTGGTACG TGATGGGCTA CAAGCAGTGG
CTCATCGGTC GCGAATACGG CTACGTCACG GTCGGGGAGA TGCTGGGCGA CCGCTTTGAC
TCCCGGGCTC TGCGTGGACT CGTCGCCGGG ATCAGCCTCT TCTGGCTACT GGAGTACGTC
ATGCTCCAGC AGGTCGGGGC CGGCCGGGCG CTGGAATCGC TCACGGAAGG GGCGATCCCC
TACTGGGCGG GGGCGACGTT GATCACCGCA TTTATGATCG GGTACGTCGT CCTCGCGGGA
ATGCGCGGCG TCGCCTGGAC GGACACGTTA CAGGGCGTGT TCATGCTCGT CCTCGTGTGG
GTTGCCCTCG CCTGGATCGC CGTCTCGGCG GGTGGCCCCG GCGAGTTGAC GGCGGCGATG
GGCGAGGTAA ACCCCGAATT CCTCTCGCTT GGCGGCGGGC TGTACTCGCC CCAATACATG
CTCTCGATGG CGATCGCGAT CGGCTTCGGC GTCGCGATGT TTCCCCAGAT CAACCAGCGC
TTTTTCGTCG CCCGGAGCGA GCGCGTCCTC AAACGGTCGC TGGCGCTGTG GCCCCTGCTG
GTCATCCTGC TTTTCGTCCC CGCGTTCATC CTCGGGGCGT GGGCGACGGG ACTCGGTGTC
ACGCCCAACG CCCGGGGGAA CATTCTCCCG CCGTTGTTGA ACGCCTACAC GCCGACGTGG
TTCGCCGCCC TGGTCGTCGC GGGCGCGATG GCCGCGATGA TGTCCTCAAG CGACTCGATG
TTGCTGTCGG GGTCGTCGTA CCTGACGCGG GACCTCTATC GCCCGTTCGT CGATCCGGAC
GCCAGCGACC GCCGGGAGGA CCTCGTCGCC AGGTTGGCAG TCGTCGCCTT CGCGCTGATC
GCACTGGCGA TGAGCCTCGG GACGAACCTG ACGCTGATCG AGATCGGCGC GACGGCGTTC
AAGGGCTACG CCCAGCTGAC TCTGCCCGTG CTGGTCGCGC TGTACTGGCG CGGGACGACG
CGTGCGGGGA TGCTCGCGGG CGTCGGTATC AGCCAGGCGT TCTACCTGCT TGCGACGTTC
ACCGACCCCG TCCCGGCGAC CTACGGGGGT TGGCAGGCCG GGCTGATCGG GATGGGGATC
GGTCTCGTGG TGACGGTCGG CGTCTCGCTG GTCACCCGTG CCGCACCCGA GGAGAACGCT
GATCGGTTCG TCACGCCAGA GAGTACCGAC GCTGACTAG
 
Protein sequence
MANVALQLGI VVGYLLLALG VGLVAYRLTE RSAEDYYLAN RTIGTVVLLF TTFATLLSAF 
TFFGGPNLAF AAGPEWILVM GLMDGVIFGI LWYVMGYKQW LIGREYGYVT VGEMLGDRFD
SRALRGLVAG ISLFWLLEYV MLQQVGAGRA LESLTEGAIP YWAGATLITA FMIGYVVLAG
MRGVAWTDTL QGVFMLVLVW VALAWIAVSA GGPGELTAAM GEVNPEFLSL GGGLYSPQYM
LSMAIAIGFG VAMFPQINQR FFVARSERVL KRSLALWPLL VILLFVPAFI LGAWATGLGV
TPNARGNILP PLLNAYTPTW FAALVVAGAM AAMMSSSDSM LLSGSSYLTR DLYRPFVDPD
ASDRREDLVA RLAVVAFALI ALAMSLGTNL TLIEIGATAF KGYAQLTLPV LVALYWRGTT
RAGMLAGVGI SQAFYLLATF TDPVPATYGG WQAGLIGMGI GLVVTVGVSL VTRAAPEENA
DRFVTPESTD AD