Gene Ava_4163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4163 
Symbol 
ID3681103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5211297 
End bp5212415 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content45% 
IMG OID637719510 
Productbinding-protein dependent transport system inner membrane protein 
Protein accessionYP_324657 
Protein GI75910361 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0546516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000167593 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATTGGT GGCAAAGACT TCAGAAAAAT CCTTTAGCGC AATTTGGGGC TATTTTACTT 
TTAATATTTT ATTTGGCGGT GATTGCGGCT GATTTTATCG CTCCATACGA CCCTTACACT
TCTCAACCGA ATGGTTCGCT ATTACCGCCA ACTAAGATTT ATTGGGTTTC AAAAACATCA
GGTAAGTTTA TCGGCCCCCA CGTTTATCCC ACAACACAAG GTAATACAGA CTTAGAAACA
GGCGATCGCC AACTCATTGT AGACGATAAA AAGCCCTCAC CTGTGCGTTT CTTTGTCTCT
GGGCCAGAAT ACCGACTGTT ACAGCTAAGT TTACCCCTAC CCCCCAAGTG GGAAGAAACC
ACAATTATCC CCGGTATCCC CTTAAATTGG CATTTATTCG GTGCAGATAA TGGGGCAAAA
CTCAACATCT TAGGTACGGA CGAACAAGGC CGCGACCAAT TTAGCCGCCT CCTACATGGT
GGACGCATTA GTATGTTTAT CGGCATTATT GGGGTGGTAA TTACTTTTCC CCTCGGTTTG
CTAATAGGGG GAATTTCCGG CTATTTCGGT GGTTGGACGG ACAGCATTAT TATGCGGATT
GCAGAAGTGC TGATGACTTT CCCCAGTATT TATCTGTTAG TTACCTTGGG GGCAGTTTTA
CCGGCTGGTT TAACTAGCAG TCAGCGATTT TTACTCATAG TTTTGATCAC CTCTGTAATT
AGCTGGGCTG GGTTAGCCAG GGTAATTCGT GGACAAGTGC TGTCAATCAA AGAACGAGAA
TTTGTCCAAG CCGCCAGGGC TATGGGTGGT AAGCCAATAT ATATTATTCT GCGTCATGTT
CTGCCGCAAA CTGCTACTTA TGTAATTATC TCTGCTACTT TGGCGGTTCC TAGCTTTATC
GGTTCAGAAG CAATACTCAG TCTCATCGGT TTAGGCATCC AACAACCAGA CCCATCTTGG
GGTAATATGC TATCTCTAGC TAGCAATGCT TCCATATTAG TGCTGCAACC TTGGTTAATT
TGGCCGCCAG CCGTGTTAAT TATTTTGACA GTTTTAGCTT TTAATTTACT CGGTGATGGC
CTTAGGGATG CCCTTGATCC TCGGAGTTTA CGCCGCTAG
 
Protein sequence
MNWWQRLQKN PLAQFGAILL LIFYLAVIAA DFIAPYDPYT SQPNGSLLPP TKIYWVSKTS 
GKFIGPHVYP TTQGNTDLET GDRQLIVDDK KPSPVRFFVS GPEYRLLQLS LPLPPKWEET
TIIPGIPLNW HLFGADNGAK LNILGTDEQG RDQFSRLLHG GRISMFIGII GVVITFPLGL
LIGGISGYFG GWTDSIIMRI AEVLMTFPSI YLLVTLGAVL PAGLTSSQRF LLIVLITSVI
SWAGLARVIR GQVLSIKERE FVQAARAMGG KPIYIILRHV LPQTATYVII SATLAVPSFI
GSEAILSLIG LGIQQPDPSW GNMLSLASNA SILVLQPWLI WPPAVLIILT VLAFNLLGDG
LRDALDPRSL RR