Gene Avin_16800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_16800 
Symbol 
ID7760614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1667263 
End bp1668474 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content63% 
IMG OID643804578 
Producttransposase, IS605 
Protein accessionYP_002798868 
Protein GI226943795 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCGAC TTCAAGCCTT CAAATTCGAA GTGATGCCAA CCGGCGAACA GCAGCGCCAG 
ATGCGCCGCT TCGCTGGCTC GTGCCGGTTC GTGTTCAACA AGGCATTGGC GTGGCAAAAG
GAACGCTACG AACAGGGCGA ATCGAGGCTC GGCTATGCCG GACTGTGCAA GCGGCTCACG
GAATGGCGGC ATGATCCGGA GACGGCCTGG CTGGCGGATG CACCGGTTCA TCCGCTGCAA
CAGGCACTCA AGGACCTGGA GCGAGCCTAC GCCAATTTCT TCGCCCAGCG GGCCGACTTC
CCGCGCTTCA AGAAGAAGGG CCGGCGCGAC AGCTTCCGCT ATCCCGACCC GAAGCAGATC
AAGCTCAACC AGGAAAATAG CCGCCTGTTC CTGCCCAAGC TCGGCTGGCT GCGTTATCGC
AACAGCCGGA ACGTGTCCGG CATGGTGAAG AACGTCACCG TCAGCCAGTG TTGCGGCAAG
TGGTTCGTGT CCATCCAGAC CGAGCGCAAG ATGGCGCAAC CCATCCCGAA GGGTGGTGCG
GTCGGCATCG ACATGGGGGT GTCCCGCTTC GCCACGCTCT CGGACGGCAC GTTCTACGCT
CCGCTCAACA GCTTCAAGCG GCACGAGGAC AGGCTGCGCA AGGCGCAGCG GGCGATGAGC
CGCAAAACCC GACTCAGCAA CAACTGGAAG AAGGCGAAAG CCCGCATCCA GCGTATCCAT
TCCCGGATCG GCAACGCCCG CCGTGACTAC CTGCACAAGA TCTCGACCAC GATCAGCCAA
AACCACGCGA TGGTGTGTAT CGAGGACTTG CCGGTGCGGA ACCTGTCCAG GTCGGCGGCA
GGCACAACCG AAGTACCGGG CAGAAACGTT CGGGCCAAGT CCGGCCTGAA CAAAGCCATC
CTCGACCAGG GCTGGTTCGA GTTCCGCCGC CAACTGGACT ACAAGCTGGC GTGGAACGGC
GGCTGGCTCG TTGCCGTGCC GCCACGGAAC ACCAGCCGCA CCTGCCCGTG CTGCGGGCAT
GTGTCGGCGG ACAACCGGCA GAGCCAGGCC CGGTTCGAGT GCGTGGAGTG TGGTTTCGAG
GAAAACGCCG ATGTGGTCGG CGCGATCAAT GTGTCAAGGG CGGGACACGC CCGGTTCGCC
TGTGAAGTGA GCGGTGTGGT AAGGCCGCCA GCAGCAGGAA CCCGCCGAGG TGAGTCGGCC
CGGGTGGGCT GA
 
Protein sequence
MLRLQAFKFE VMPTGEQQRQ MRRFAGSCRF VFNKALAWQK ERYEQGESRL GYAGLCKRLT 
EWRHDPETAW LADAPVHPLQ QALKDLERAY ANFFAQRADF PRFKKKGRRD SFRYPDPKQI
KLNQENSRLF LPKLGWLRYR NSRNVSGMVK NVTVSQCCGK WFVSIQTERK MAQPIPKGGA
VGIDMGVSRF ATLSDGTFYA PLNSFKRHED RLRKAQRAMS RKTRLSNNWK KAKARIQRIH
SRIGNARRDY LHKISTTISQ NHAMVCIEDL PVRNLSRSAA GTTEVPGRNV RAKSGLNKAI
LDQGWFEFRR QLDYKLAWNG GWLVAVPPRN TSRTCPCCGH VSADNRQSQA RFECVECGFE
ENADVVGAIN VSRAGHARFA CEVSGVVRPP AAGTRRGESA RVG