Gene Avin_42370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_42370 
Symbol 
ID7763113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4264408 
End bp4265619 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content63% 
IMG OID643807088 
Producttransposase 
Protein accessionYP_002801336 
Protein GI226946263 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCGAC TTCAAGCCTT CAAATTCGAA GTGATGCCAA CCGGCGAACA GCAGCGCCAG 
ATGCGCCGCT TCGCTGGCTC GTGCCGGTTC GTGTTCAACA AGGCATTGGC GTGGCAAAAG
GAACGCTACG AACAGGGCGA ATCGAGGCTC GGCTATGCCG GACTGTGCAA GCGGCTCACG
GAATGGCGGC ATGATCCGGA GACGGCCTGG CTGGCGGATG CACCGGTTCA TCCGCTGCAA
CAGGCACTCA AGGACCTGGA GCGGGCCTAC GCCAATTTCT TCGCCCAGCG GGCCGACTTC
CCGCGCTTCA AGAAGAAGGG CCGGCGCGAC AGCTTCCGCT ATCCCGACCC GAAGCAGATC
AAGCTCAACC AGGAAAATAG CCGCCTGTTC CTGCCCAAGC TCGGCTGGCT GCGTTATCGC
AACAGCCGGA ACGTGTCCGG CATGGTGAAG AACGTCACCG TCAGCCAGTG TTGCGGCAAG
TGGTTCGTGT CCATCCAGAC CGAGCGCAAG ATGGCGCAAC CCATCCCGAA GGGTGGTGCG
GTCGGCATCG ACATGGGGGT GTCCCGCTTC GCCACGCTCT CGGACGGCAC GTTCTACGCA
CCGCTCAACA GCTTCAAGCA GCATGAGAAA CGACTGCGCA AGGCGCAGCG GGCGATGAGC
CGCAAGCAGA AGTTCAGCAA CAACTGGAAG AAGGCGAAAG CCCGCGTCCA GCGTATCCAT
TCCCGGATCG GCCATGCCCG CCGCGACTAC CTGCACAAGA TCTCGACCAC GATCAGCCAA
AACCACGCGA TGGTGTGTAT CGAGGACTTG CCGGTGCGGA ACCTGTCCAG GTCGGCGGCA
GGCACAACCG AAGTACCGGG CAGAAACGTT CGGGCCAAGT CCGACCTGAA CAAATCCATC
CTCGACCAGG GCTGGTTCGA GTTCCGCCGC CAACTGGACT ACAAGCTGGC GTGGAACGGC
GGCTGGCTCG TTGCCGTGCC GCCGCAGAAC ACCAGCCGCA CCTGCCCGTG CTGCGGGCAT
GTGTCGGCGG ACAACCGGCA GAGCCAGGCC CGGTTCGAGT GCGTGGAGTG TGGTTTCGAG
GAAAACGCCG ATGTGGTCGG CGCGATCAAT GTGTCAAGGG CGGGACACGC CCGGTTCGCC
TGTGAAGTGA GCGGTGTGGT AAGGCCGCCA GCAGCAGGAA CCCGCCGAGG TGAGTCGGCC
CGGGTGGGCT GA
 
Protein sequence
MLRLQAFKFE VMPTGEQQRQ MRRFAGSCRF VFNKALAWQK ERYEQGESRL GYAGLCKRLT 
EWRHDPETAW LADAPVHPLQ QALKDLERAY ANFFAQRADF PRFKKKGRRD SFRYPDPKQI
KLNQENSRLF LPKLGWLRYR NSRNVSGMVK NVTVSQCCGK WFVSIQTERK MAQPIPKGGA
VGIDMGVSRF ATLSDGTFYA PLNSFKQHEK RLRKAQRAMS RKQKFSNNWK KAKARVQRIH
SRIGHARRDY LHKISTTISQ NHAMVCIEDL PVRNLSRSAA GTTEVPGRNV RAKSDLNKSI
LDQGWFEFRR QLDYKLAWNG GWLVAVPPQN TSRTCPCCGH VSADNRQSQA RFECVECGFE
ENADVVGAIN VSRAGHARFA CEVSGVVRPP AAGTRRGESA RVG