Gene Ava_4146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4146 
Symbol 
ID3681085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5170224 
End bp5171264 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content44% 
IMG OID637719492 
ProductABC transporter, substrate-binding protein, aliphatic sulphonates 
Protein accessionYP_324640 
Protein GI75910344 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.133016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAC GCCGTCGTTT TCTAAATGTC GCTACATCTA GTCTGGGCGG TTTTTCTATG 
GCCTATTTGT TGGGAAGTTG TAGCCAACAA AGCCAAAAGA ATGTAGCAAA TAGTAGCAGT
TCTCTGGCGA TAAAAACGAA GGTTCTCCGC ATGGGGTATC AAAGTGCAGG GGATCTTGTC
CGTAATCGGC AAGTTTTAGA AAAACGCTTA GATCCTCTGG GAATTAAAGT AGAATGGCTG
CAATTTGCCC AAGGGCCGCA GCTGATGGAA GGGATGGCTG CCCGTAGGGT TGATATAGGC
TCAGTGGGAG AAACCCCACC AATTTTTGCC CAAGTGGCTG GATCAGACAT TGTTTATGTC
GTGGGTACAC AAAGAAACGC AGGAACCGGC AGAAGTAGCG TGATTGCAGT GCCGCCAGAG
TCTCCACTCA CCAAGTTTGA GGAGATTAAG GGGCAAGAAG TTTATTTTCA AAAAGGCTCG
GCATCACATT ATTTTATGCT CAGGGCTTTA CAGTCGATTG GTTTGACCAT CAAAGATATC
AAAATCAAAA GTATGGCAAC TATCGAGGCG CGGGCTGCTT TTCTGGAGGG AAAAATCCCT
GTTTGGATGA CAGGAGATCC CCACTATGCG ATCGCTGAAA AAATGAATCG CATTCGTGTT
CTCAGAGATT CTGTTGGCTT GGATTCTCCT GGCGGCTACT ACATTGCTGA TAGAAAATTT
GCTCAAGAAA ATCCTGGTGT TCTGAAAATT CTGATTGAGG AACTTCATGC GCTTGATAAA
TGGGCAGAAG TGAATCGAGA TGAAGTCAAA AAATTGATGA TTACCCAACA AAAGCTGGAT
GAAGATGTAG CCGAAAGGGT AATGTCTCGT CGCACTTTTG CCGGACGCAG AGGTTTAAGT
CCGGCACTAA TAGCGGAACA ACAGCGTGTA GCAGATTTAT TCTTTAAAGA AGGTGTAATC
CCCAAGAAAA TCAACATCAG TGATGCCTTA CTCCCATCTG ATTTGTATGC TGCAATCACA
CCGCCAGAAA TTATGGTTTA A
 
Protein sequence
MIKRRRFLNV ATSSLGGFSM AYLLGSCSQQ SQKNVANSSS SLAIKTKVLR MGYQSAGDLV 
RNRQVLEKRL DPLGIKVEWL QFAQGPQLME GMAARRVDIG SVGETPPIFA QVAGSDIVYV
VGTQRNAGTG RSSVIAVPPE SPLTKFEEIK GQEVYFQKGS ASHYFMLRAL QSIGLTIKDI
KIKSMATIEA RAAFLEGKIP VWMTGDPHYA IAEKMNRIRV LRDSVGLDSP GGYYIADRKF
AQENPGVLKI LIEELHALDK WAEVNRDEVK KLMITQQKLD EDVAERVMSR RTFAGRRGLS
PALIAEQQRV ADLFFKEGVI PKKINISDAL LPSDLYAAIT PPEIMV