Gene Hoch_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1944 
Symbol 
ID8544326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2672798 
End bp2674357 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content68% 
IMG OID646386648 
ProductAbgT putative transporter 
Protein accessionYP_003266383 
Protein GI262195174 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2978] Putative p-aminobenzoyl-glutamate transporter 
TIGRFAM ID[TIGR00819] p-Aminobenzoyl-glutamate transporter family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.432903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00626673 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCCGACT CCCCATCGAC GCCGCGCTCG AATGAGACGC CGCGTCTCGC CGACAAAGTC 
CTCTCGTGGA TCGAGGCCGC CGGCAACAAG CTGCCCGATC CCGCCGTCAT CTTCGTCATC
GCCATGCTGC TGGTGTGGAT CGCCTCGGCG CTGCTCGACG GCCACGCCTT CGAGGTGCCG
GCCAAGGACG GCATCAAGGC CAACCAGATC GTCAGCCAGC TCGGCGCCGA AGCGCTCACC
ACCTTCTTCG CCGAGATGGT GCACACCTTC ACCAGCTTCC ACCCGCTCGG CGTGGTCCTC
GTCGCCCTGC TCGGCGTCGG CGTGGCGGAA TCCTCGGGCT TCATCAACGC CTGCCTCAAG
GGCCTGCTCA GCTTCACGCC CAAGGCGCTG CTCACGCCCA TGGTCATCTT CGTCGGCATC
CTCAGCCACA CCGCGGCCGA CGCCGGTTAC GTGCTGGTCA TCCCGCTCGG CGGCGTCATC
TTCTACGCCG CCGGCCGCCA TCCATTGGCC GGCATCGCGG CCGCCTTCGC CGGCGTCTCG
GGCGGCTTCA GCGCCAACCC CATCCCCTCG GCCATCGACC CGCTGCTGCA GGGCCTCACC
CAGGAGGCCG CCGGCATCCT CGACCCCGCG CGCGAGGTCA ACCCGCTGTG CAACTGGTGG
TTCATGGCCG CCTCGAGCAT CCTCATCGTC GGCCTCGGCT GGCTGCTCAC CGACAAGGTC
GTCGAGCCGC GCCTCCGCGC CAACGCCGAG ATCGACGGCG ACCCCGAGGA CATGCCGACC
ATGGAGTCGA TGACCTCGGA CGAGCGCCGC GGCATGTGGG CCGGGCTCGT CGCCATGTTC
GCCGGCGTCG CCACCCTGGT GCTGCTGTGC CTGCCCAGCG ACTCGCCGCT GCGCGCGCCC
GACGGCGATC TGACCGCGTT CCAGGCGCCG CTGATGCAGA TGATCGTGCC GCTGATCTTC
CTGGTCTTCC TGGTCCCCGG CATCGTCCAC GGCTACGTGT CCAAGACCTT CGAGAGCCAC
CGCGACATCA TCAAGGGCAT GAGCAAGACC ATGAGCACCA TGGGCTACTA CATGGTCATG
GCCTTCTTCG CCTCGCTGTT CATCTACTCG TTCGGCAAGT CGCAGATCGG CGCGCTGCTG
GCGGTCGAGG GCGCCAACAT CCTGCGCAGC CTGGCCCTGC CCGGCGAGGT GACGCTGTTC
GGCATCGTCC TCTTGTGCGC GGCCGTCAAC CTGCTCATCG GCTCGGCCTC GGCCAAGTGG
GCGCTGCTGG CGCCGATCTT CGTGCCCATG CTCATGCAGC TCGGCATCTC GCCCGAGCTC
ACCCAGGCCG CGTACCGCGT CGGCGACTCG ACCACCAACA TCATCACGCC GCTGATGCCG
TACTTCCCGC TGGTGGTCGC GTTCTCGCAG CGCTACGTCA AGAAGACCGG TATCGGCACG
CTCACGGCCG CGATGCTGCC GTACTCGATC ACCTTCCTGA TCGCCTGGAG CGTGTTCCTG
GTGGCGTTCT GGCTGCTCGG CATCCCGCTC GGCATCCAGG GCGTGTACGC CTACCCCTGA
 
Protein sequence
MSDSPSTPRS NETPRLADKV LSWIEAAGNK LPDPAVIFVI AMLLVWIASA LLDGHAFEVP 
AKDGIKANQI VSQLGAEALT TFFAEMVHTF TSFHPLGVVL VALLGVGVAE SSGFINACLK
GLLSFTPKAL LTPMVIFVGI LSHTAADAGY VLVIPLGGVI FYAAGRHPLA GIAAAFAGVS
GGFSANPIPS AIDPLLQGLT QEAAGILDPA REVNPLCNWW FMAASSILIV GLGWLLTDKV
VEPRLRANAE IDGDPEDMPT MESMTSDERR GMWAGLVAMF AGVATLVLLC LPSDSPLRAP
DGDLTAFQAP LMQMIVPLIF LVFLVPGIVH GYVSKTFESH RDIIKGMSKT MSTMGYYMVM
AFFASLFIYS FGKSQIGALL AVEGANILRS LALPGEVTLF GIVLLCAAVN LLIGSASAKW
ALLAPIFVPM LMQLGISPEL TQAAYRVGDS TTNIITPLMP YFPLVVAFSQ RYVKKTGIGT
LTAAMLPYSI TFLIAWSVFL VAFWLLGIPL GIQGVYAYP