Gene Avin_05900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_05900 
Symbol 
ID7759546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp569243 
End bp570388 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content60% 
IMG OID643803510 
Producttransposase IS891/IS1136/IS1341 
Protein accessionYP_002797818 
Protein GI226942745 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.542736 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAC GCGCCTACAA GTACCGCTTC TACCCGACTC CAGAACAAGC AGAACTGCTT 
GCCAGGACGT TCGGTTGCGT GCGTTCCGTC TACAATCGCA TCCTGCGCTG GCGTACCGAT
GCCTTCTACC AGGAGCAGAA GAAGATCGGC TATACGGCGG CCAGCAGTCG CCTGACCGCG
CTCAAGAAGC AACCGGAGCT GGCTTTTCTC AATGAAGTCA GTGCGGTGCC ATTGCAGCAG
TGCCTTCGCC ACCAGCAGGC TGCGTTCAAG AACTTTTTCG AGGGTCGAGC GAAGTACCCG
GTCTTCAAGA AGAAACGGCA CCGGCAGTCC GCCGAGTTCA CCAGCTCGGC CTTCCGCTAC
CGAGACGGCA AGCTGTTCCT GGCCAAGTGC GACGAGCCCC TGGCGATCCG CTGGAGTCGG
CCACTTCCTG GTGAGCCTTC CACGGTCACG ATTTCCCGGG ACTCTGCAGG GCGGTACTTC
GTCTCCTGCC TGTGCGAGTT CGAACCCGAG GCACTGCCCG TCACGCCGAA GACGATCGGC
ATCGACATGG GCATCAAAGA CCTGTTCGTC ACCAGCGAGG GCGAACGGAT CGGCAATCCC
CGCCATACGG CCAAATACGC CACCCGTCTG GCTAGGGCAC AGCGTCGACT GAGCAAGAAG
AAACTCGGCT CGGAGAACCG CGCCAAGGCC CGACTGAAAG TGGCCCGTAT TCACGCCAAA
ATTTCCGATT GCCGAGCGGA CAGCTTGCAC AAGCTGTCCC GCAGACTGAT TAACGAGAAC
CAAGTGGTCT GCGCTGAAAC CCTTGCCGTG AAGAATATGA TCCGCAATCC GAAACTGAGC
AAAGCCATTG CCGATGCGGG ATGGGGCGAA TTGACGCGCC AGATCCAGTA CAAAGGTGAA
TGGGCCGGTC GGCAGATCGT CCAGATCGAC CGCTGGTATC CCTGCTCGAA ACGCTGTGCC
TGCTGCGGGC ATATCCTTGA GCGCCTGCCG CTGGATGTTC GCCGCTGGAG TTGCCCGGAA
TGCGGAACCG AGCATGACCG CGACGTGAAC GCCGCGATCA ACATTAAAGC CGCCGGGCTG
GCGGTGTTAG CCCTTGGAGA GAACGTAAGC GGCATGGGTC AAGTATCCGT GTCCTGTTCT
CAGTGA
 
Protein sequence
MTKRAYKYRF YPTPEQAELL ARTFGCVRSV YNRILRWRTD AFYQEQKKIG YTAASSRLTA 
LKKQPELAFL NEVSAVPLQQ CLRHQQAAFK NFFEGRAKYP VFKKKRHRQS AEFTSSAFRY
RDGKLFLAKC DEPLAIRWSR PLPGEPSTVT ISRDSAGRYF VSCLCEFEPE ALPVTPKTIG
IDMGIKDLFV TSEGERIGNP RHTAKYATRL ARAQRRLSKK KLGSENRAKA RLKVARIHAK
ISDCRADSLH KLSRRLINEN QVVCAETLAV KNMIRNPKLS KAIADAGWGE LTRQIQYKGE
WAGRQIVQID RWYPCSKRCA CCGHILERLP LDVRRWSCPE CGTEHDRDVN AAINIKAAGL
AVLALGENVS GMGQVSVSCS Q