Gene Ava_4540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4540 
Symbol 
ID3680144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5691895 
End bp5693217 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content42% 
IMG OID637719896 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_325033 
Protein GI75910737 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACACG TTTCCAGAAG AAAATTTCTT TTCACCACAG GTGCGGCGGC GGCGGCTTCT 
ATTTTGGCTC ATGGTTGCAC TTCCAATGGT TCTCAATCAG CTACCACCGG AGAACAAGCA
CCTTCAGCAG CACCAGCCGC TAACGTCTCA GCCGCTAACG CACCCAAGGT AGAAACAACC
AAAGCCAAGC TAGGATTCAT CCCGCTTACT GATGCTGCAC CCCTCATCAT TGCTAAAGAG
AAAGGCTTCT TTGCTAAATA TGGCATGACC GATGTTGAAG TCATCAAGCA AAAATCTTGG
CCTGTCACCC GCGATAACTT AAAAATTGGC TCATCTGGTG GTGGTATTGA TGGCGCACAT
ATCCTTAGTC CCATGCCTTA CCTCATGACC ATCAAAGATA AAGTGCCAAT GTACATTTTG
GCTAGATTAA ATACTAATGG CCAGGCTATT TCTGTAGCTG AAAAATATAA AGACCTGAAC
ATTAATTTAG AAAGCAAAAA CCTTAAAGAC GTAGCCGCCA AAGCCAAAGC TGATAAGAAA
GCCTGGAAAG CTGGTATTAC CTTTCCTGGT GGGACACATG ATTTATGGAT GCGCTATTGG
TTAGCGGCTG GTGGTATTAA TCCTGATCAA GATGTTGTGT TGGAACCTGT TCCACCGCCA
CAAATGGTAG CAAACATGAA AGTTGGGACT GTCGATTCCT TCTGTGTTGG AGAACCTTGG
AATGCTCAGT TAGTCAACCA AAAATTAGGT TATTCCGCTT TAGTTACAGG CGAATTATGG
AAAGATCATC CAGAAAAAGC CTTTACCATG CGGCAAGATT GGGTTGATCA AAATCCCAAT
GCAGCCCAGG CAATTTTGAT GGCAATTTTA GAAGCACAAC AATGGTGCGA CAAGGCAGAA
AACAAAGAAG AAATGTGTAA AATCTGCGCT GACCGTAAAT ACTTAAATGT TGCTGCCGCA
GATATTGTAG AAAGAGCTAA AGGCAATATC GATTATGGTG ATGGTCGTAA GGAAGAAAAC
TTTGCTTATC GGATGAAATT CTGGGCAGAT AATGCTTCCT ATCCCTATAA GAGTCACGAT
ATTTGGTTTT TAACTGAAGA TATTCGCTGG GGTTATTTAC CAAAAGATAC TAAAGTTCAA
GACATTGTTA ACCAAGTTAA TAAAGAAGAC TTGTGGAAGA AAGCAGCGAA AGCAATTGGT
GTGGCTGATG CGGAAATTCC TGCTAGCAGT TCCCGTGGGG TGGAAACTTT CTTTGATGGC
GTGAAATTTG ACCCAGAAAA GCCAGAAGAA TACTTAAATA GTTTGAAAAT CAAAAAAGTT
TAA
 
Protein sequence
MTHVSRRKFL FTTGAAAAAS ILAHGCTSNG SQSATTGEQA PSAAPAANVS AANAPKVETT 
KAKLGFIPLT DAAPLIIAKE KGFFAKYGMT DVEVIKQKSW PVTRDNLKIG SSGGGIDGAH
ILSPMPYLMT IKDKVPMYIL ARLNTNGQAI SVAEKYKDLN INLESKNLKD VAAKAKADKK
AWKAGITFPG GTHDLWMRYW LAAGGINPDQ DVVLEPVPPP QMVANMKVGT VDSFCVGEPW
NAQLVNQKLG YSALVTGELW KDHPEKAFTM RQDWVDQNPN AAQAILMAIL EAQQWCDKAE
NKEEMCKICA DRKYLNVAAA DIVERAKGNI DYGDGRKEEN FAYRMKFWAD NASYPYKSHD
IWFLTEDIRW GYLPKDTKVQ DIVNQVNKED LWKKAAKAIG VADAEIPASS SRGVETFFDG
VKFDPEKPEE YLNSLKIKKV