Gene Aave_3046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAave_3046 
Symbol 
ID4668107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax citrulli AAC00-1 
KingdomBacteria 
Replicon accessionNC_008752 
Strand
Start bp3355748 
End bp3356773 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content66% 
IMG OID639824248 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_971387 
Protein GI120611709 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.398245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0441493 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTTC GCCGCCACTT TATCAAGTTT CCCGCCGCCG CCGCCCTCGC AGGTTCCGCC 
GGCGTCCTGG CCACGCCGGC CATCGCCCAG CAGCCGGTGC AGCTGCTCAA CGTGTCGTAC
GACCCCACGC GGGAGCTGTA TGTGGACTAC AACCAGGCGT TCGCGAAGTA CTGGAAGGGC
AAGACCGGGC AGGATGTGCA GGTCAGGCAG TCCCACGGCG GCTCGGGCAA GCAGGCCCGC
TCGATCATCG ACGGCATCGA TGCCGACGTG GCCACGCTGG CCCTGGGCGG CGACATCGAT
GCGCTGGTGA AGAACGGCGG CCTGGTCAGG CCCGACTGGC AGAAGCGCCT GCCGCACAAC
TCCGCCCCGT ACACCTCCAC GATCGTGTTC CTCGTCAAGA AGGGCAATCC CAAGGGCATC
AAGGACTGGG ACGACCTGGT GAAGCCCGGC GTGCAGGTGA TCACCCCCAA TCCCAAGACC
TCCGGCGGCG CGCGCTGGAA CTACCTGGCC GCCTGGGAAT ACGCCAAGCG CAAGTACGGC
GGCGATGCGC AGGCCAGGGA ATTCGTCGGC AAGCTCTACA AGAACGTGCC GGTGCTCGAT
ACCGGCGCCC GCGGCTCCAC GATCACCTTC GTGCAGCGCG GCGTGGGCGA CGTGCTCCTG
GCCTGGGAGA ACGAAGCCTT CCTGGCCCTG AAGGAGTTCG GCGCGGAGAA GTTCCAGATC
GTCGCCCCCT CGCTTTCCAT CCTGGCCGAG CCGAGCGTCG CCGTGGTGGA CAAGGTGGTG
GACAAGAAGG GCACGCGCGC CGTGGCCGAG GAATACCTCA AGTACCTGTA TTCGGACGAA
GCCCAGGACA TCGCCGGCAG GCATTTCTAC CGGCCGACCG GCGAGAAGGC GAAGGCCAAG
TACGACGCGC AGTTCCCCAA GCTCACGCTC GCCACCATCG ACCAGGCATT CGGCGGCTGG
GGCAAGGCCG ACAAGGACCA CTTCGCCGAC GGCGCGAGCT TCGATCAGAT CTACACGAAG
AAGTGA
 
Protein sequence
MNLRRHFIKF PAAAALAGSA GVLATPAIAQ QPVQLLNVSY DPTRELYVDY NQAFAKYWKG 
KTGQDVQVRQ SHGGSGKQAR SIIDGIDADV ATLALGGDID ALVKNGGLVR PDWQKRLPHN
SAPYTSTIVF LVKKGNPKGI KDWDDLVKPG VQVITPNPKT SGGARWNYLA AWEYAKRKYG
GDAQAREFVG KLYKNVPVLD TGARGSTITF VQRGVGDVLL AWENEAFLAL KEFGAEKFQI
VAPSLSILAE PSVAVVDKVV DKKGTRAVAE EYLKYLYSDE AQDIAGRHFY RPTGEKAKAK
YDAQFPKLTL ATIDQAFGGW GKADKDHFAD GASFDQIYTK K