Gene Ava_4554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4554 
Symbol 
ID3680119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5706227 
End bp5707870 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content37% 
IMG OID637719910 
ProductTPR repeat-containing protein 
Protein accessionYP_325047 
Protein GI75910751 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.196541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTGGA TTACGCTACT GCGATCGCTA CAGTCTGATT TTATTAAAAG GTTATCATCT 
GGTTGTCTGC TGCATTGCGA AATCGAAGGT CAATATAGTG AGTTAACAGT CATCTCTGGC
GAAAGATTAA AAACCCTACG AGATTTTTGC TGGCTGATGG CTGAAAAATA CAAGCGGGTT
TCGCCAGTTC GTGATGTTTT TATTAGCTAC CTCAAGGGGA AATTAGGTGA GGAAGTTGTT
AAAGAACGTT TAGCTGATTT GATTACCGAA GTAGATTATG AGAAGCGGCT TGGTGGCGAT
GGCAAGATAG ATTTTACTTT AACTGCTAAC CCTGCAATTG GCATTGAAGT TAAATCTCGT
CATGGCAACA TTGATAGAGT GAGATGGTCA GTTAGTGCCG AAGAAGTGGA AAAAAATGCA
GTTGTAGTTT GCATTTTTAT TAAAGAAGAT GTTAATGAAG CACAATCATC ATATCATCTG
TTATTAGCTG GCTTTTTACC TACTCAAATG ATTAAATTAA AGACAGGTAA TATCTCATTT
GGAATAGAGC AATTACTTTA TGGTGGCGGC TTATGGGGTT ATTTGGAACA GTTGCAAGCT
TCCAGCAACT ATCATCAGTT CCAGCAATCT CCGCCAATTT ATGAATATCA ACCCCAGCCA
GAATTTTCAA CTAAAATCAA TCAAAGTCAA TCAATCAAAC CAGCTTTATT TACTGGTATC
AAAAATATTC TATCTTATAG ACGAGAAGAA GATATAAATA TAGATTATAT AAAACTTGGT
GATGAGTGTT TTGCTCAAGG TGAATATACT GCATCTATTA AGAATTATAG CCAAGCTTTA
CAAGCAAGTA GTAATAATGG TGAATTATAT TATAAACGAG GTTTAACTTA TTATCAATTG
GGAGATTATG AGGCGGCGAT CGCTGATTAT TCTCAAGCCA TAAATCTCAA CTTTCACGAT
GCTAAATCCT ATCATAAACG TGGCTTGGCT TTATCACAAC TAGCAGCTTA TGAAGCGGCA
ATTGACGATT ATAACCAAGC AATTAGAATT AATCCTCATG CTGCTTCTAT TTATAAAAAC
CGAGCAGAAG CACGCTCTCA TTTAGGAGAT AATCAAGGAG CGATTGAAGA TTATACCCAA
GCGATCAAGA TTAATCCCCA ATATGCAGAT ACATATAAAA ATAGAGGCAT ATCTCGTTAT
TTATTAGCAA CACAACCAGG ATTTACCCAA GCAATTAAGA TTAATCCCAA TGATGCTAAT
GCTTACAAAA ATCGTGGTAA TGCGCGTGCT GATATTGGTG ATTATGCAGG AGCGATTGAA
GATTATAATC AGGCAATCCA AATTAATCCC AAGGCGGCTG ATGCTTATTA TAACCGTGGT
AACGCCCGTT ATGATTTAGG GGATGAAGAA GGAGCGATCG CTGATTACAC CCAAGCAATC
CAAATTAATC CCAGCTATGC TGATGCTTAT TATAACCGTG GTAATGTGCG TGCAGGCATA
AAAGATAAAC AAGGCGCGAT CGCTGACTTT CAAAAAGCAG CAGATATATA TCGTAAAGAA
GGTAAATTAG CAGAACTCAA AGATGCAACA GAAAGAATTG TAGAATTGGA AATAGAAGAA
TCCATTGATA TTTTAAATTT TTAA
 
Protein sequence
MDWITLLRSL QSDFIKRLSS GCLLHCEIEG QYSELTVISG ERLKTLRDFC WLMAEKYKRV 
SPVRDVFISY LKGKLGEEVV KERLADLITE VDYEKRLGGD GKIDFTLTAN PAIGIEVKSR
HGNIDRVRWS VSAEEVEKNA VVVCIFIKED VNEAQSSYHL LLAGFLPTQM IKLKTGNISF
GIEQLLYGGG LWGYLEQLQA SSNYHQFQQS PPIYEYQPQP EFSTKINQSQ SIKPALFTGI
KNILSYRREE DINIDYIKLG DECFAQGEYT ASIKNYSQAL QASSNNGELY YKRGLTYYQL
GDYEAAIADY SQAINLNFHD AKSYHKRGLA LSQLAAYEAA IDDYNQAIRI NPHAASIYKN
RAEARSHLGD NQGAIEDYTQ AIKINPQYAD TYKNRGISRY LLATQPGFTQ AIKINPNDAN
AYKNRGNARA DIGDYAGAIE DYNQAIQINP KAADAYYNRG NARYDLGDEE GAIADYTQAI
QINPSYADAY YNRGNVRAGI KDKQGAIADF QKAADIYRKE GKLAELKDAT ERIVELEIEE
SIDILNF