Gene Avin_18790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_18790 
Symbol 
ID7760813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1863283 
End bp1865016 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content74% 
IMG OID643804777 
Producttetratricopeptide (TPR) repeat and VWA domain-containing protein 
Protein accessionYP_002799066 
Protein GI226943993 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00254687 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATCC TCTGGCCGCA CTGGCTGCGC CCCGACTGGT TGCTCCTGCT GCCCCTGCTC 
GCCTGGCTGC TCTGGCGGCT GTGGCGCCGC GAACGGCGCG CCGGACGCTG GGAACTGCTG
CTGCCGCCGG CCTTTCACCA GGCCCTGCTC GGCGCGCGCA GCGGACGCGG CAGCCGCCTG
CCGTGGATCG CGCTCGGGCT GGGCTGGCTG CTCGCCCTGC TGGCCCTGCT CGGGCCGAGT
TGGCAACGCA TCGAACAAAG CCCCCTGAAG CGCGCCGATC CGCTGGTGGT ACTGCTCGAA
CTGACCCCGA GCATGCTCGC CGGCGACGTC GCCCCGAACC GCCTGGCGCT AGCCCGGCAC
AAGCTGCTCG ACCTGCTCGA GGCGCGCCAG GAGGCGCAGA CCGCGGTGGT CGTCTACGCC
GGCAGCGCAC ACACCCTGGT GCCGCTGTCC GACGACCTGG AAACCACCCG CAATCTGCTG
GACGCCCTCG CCCCGCCGCT CATGCCGGTT GCCGGGCGGC GCGCCGACCT CGCCGTGGCC
CGCGGCCTGG CCCTGCTCGA ACAGGGTGCC CAGGGCCGTG GCCAGTTGTT GCTGATCGGC
AGCGAACTGG ACGAACGGGA ACGCCAGGGC ATCGCCCAGG CGCTCGGCGA CGACGGCGAG
CGCCTCGCCA TCCTCGGCAT CGGCAGCCCT GACGGCGCGC CCATCGCCCT GGAGGACGGC
AGCTTCCTCA AGGACGGACA GGGCGCGATC CTCCTCGCCC GGCTGGACAG CGGCGGACTC
GCGCGCTTCG CCGACAGCCT CGGCGGGCGC TACCAGAGCG CCGGCCTGGA CGACCGCGAC
CTGCAACGGC TGGGCCTGCT CCAGGGGCCG CGACTGCTGC GCGAGGGCGA CCAGACGACA
CGCCTGGACG CCTGGGCCGA CCAGGGCCAC TGGCTGCTGC TGCCGCTGCT CCTGCTGGCC
GCCTGCGCCG GCCGGCGCGG CTGGCTGCTC GCCCTGCCCC TGCTGTTCGT CCTGCCCCAG
CCCGGCCAGG CCTTCGAGCT CACCGACCTG TGGCTGCGCC CCGACCAGCA GGGCCGCATC
CTGCTCCAGG CCGGCCGGCC GGGCGAGGCC GCGCGGCGCT TCGAAGACAG CCAGTGGCAG
GGCCTGGCCC TCTACCAGGC CGGCGACTAC GCCGCCGCCG CCGAGCGCTT CGCCCAGGGC
CAGGGGGCCG CCGCCCACTA CAACAGCGGC AACGCCCTGG CTCGGGCCGG CGAACCGGAG
GCCGCGCTGG ACGCCTACGA GCGTGCCCTG GAACTGCAGC CGGCGCTGGA AGCGGCGCAG
CACAACAAGG CGCTGGTCGA GGCGCTGCTG CGCCAGCGGC AGGCGCGTCA GCCCGACGCC
GATGGCAGTG CCGAGCGGCA GCAGGCCCCG CCGAACCGGG ACAGCCCGTC CGGCCAGGCT
GGGCAGAACG ATATCCGGGC GGACACGACG GCCTCCCAGG CCGCAGCGGA GGCAGCCCCG
GACGAAGCGA CGGCGCAGCC GGGCACGCCC GGCTCGCTAT CGGCCGAGGG ACAGACCGCG
GGCGATACGG CGCCGGGCCC CGCCGCGGCC GCCGGAGCCG TGTCCGCGCC CGTCACCGAG
GAACGCCGCC AGGCCCTGGA ACAATGGCTG CGGCAGATCC CCGACAATCC CGGAGAATTG
CTGCGGCGCA AATTCTGGTA CGAACAGCAA CAGCACCGGG AAACCAGCCC ATGA
 
Protein sequence
MNILWPHWLR PDWLLLLPLL AWLLWRLWRR ERRAGRWELL LPPAFHQALL GARSGRGSRL 
PWIALGLGWL LALLALLGPS WQRIEQSPLK RADPLVVLLE LTPSMLAGDV APNRLALARH
KLLDLLEARQ EAQTAVVVYA GSAHTLVPLS DDLETTRNLL DALAPPLMPV AGRRADLAVA
RGLALLEQGA QGRGQLLLIG SELDERERQG IAQALGDDGE RLAILGIGSP DGAPIALEDG
SFLKDGQGAI LLARLDSGGL ARFADSLGGR YQSAGLDDRD LQRLGLLQGP RLLREGDQTT
RLDAWADQGH WLLLPLLLLA ACAGRRGWLL ALPLLFVLPQ PGQAFELTDL WLRPDQQGRI
LLQAGRPGEA ARRFEDSQWQ GLALYQAGDY AAAAERFAQG QGAAAHYNSG NALARAGEPE
AALDAYERAL ELQPALEAAQ HNKALVEALL RQRQARQPDA DGSAERQQAP PNRDSPSGQA
GQNDIRADTT ASQAAAEAAP DEATAQPGTP GSLSAEGQTA GDTAPGPAAA AGAVSAPVTE
ERRQALEQWL RQIPDNPGEL LRRKFWYEQQ QHRETSP