Gene Avin_43140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_43140 
Symbol 
ID7764102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4356428 
End bp4358236 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content60% 
IMG OID643807169 
Productphage integrase-like protein 
Protein accessionYP_002801410 
Protein GI226946337 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCCT CCCCCTCCTA CCTGACCAAG AACCGCCACG GAACCTTCTA CTTCCGAATG 
GTCATTCCGG CCCCGTTGCG CACCCTGATC AACAGCAAGC GCGAAGTCAG ACGCAGCCTC
AAGACCGACA GCCGGCGACT TGCCATCAAA CGAGCCCGCC AGTTCGCGGT CAGATACGAA
ACGGCATTCG ACAAGGCGAT CAGCAGCATG ACGACTACCA GGGACGGCGA CGACGTTCTA
ACCGAGGAAG ACATCAAGCT CTTGGAAGAG CTGGACCTAC CGGCAGCCGG CGCATGGTCG
GATCAACCGA GCAACACCCC GCCGGAGCCG ATCCTTACCG ACGAACAGAT TGAGGCCCGA
CAACGGCGCC GGGAAGTTGA ACGCCTTCTT GCTGGCGCCT ACGGCCGCGC CATTCCTACC
GATCAGGAGC CGCTTGCCTC CCGGCTTCTG GAGCTTTCCA AGCCCTACCA GCCCACAGAG
CTGCGGCAGA TACTCCCCAG GCTCCGGGAC GAGCTGATCA AGAGCGCCAT TGCCCCTGCA
CCGGCGCCAG CACCGGCTCC GACCTTCGAT CCAGCCATGG CAGACTGGAC CCTGTACCAA
GTTTGGCAAC ATCAGCTCGA ACGCGACCGG GCCGACATCG CAGCGACCGG GGGCCAGGCA
CGGCACGGGG GCACCCTTGA AGAACGCGAG CGACGCGCCA GGGTTATGAC TGTGCTCACC
CAGCACAAGC CTGTATGCCA GCTCTCAAAG CGCGACTGGC AGGCCGCTTA TGACGCAGCC
CGCCGCATGA AAGCCGGAGT CACGGTATCC GTTGCCCCAG ACCCTCAAAC TCCGCTCGCC
GAGCTTCTAA CGGACGATCC GGCGCTCATG ACCGGGCATG AACGGACAAC CGCCGTCATC
GCCTCGATAA AGCAGCTCCA GACCTATGCG CGCTTCTTGG AGCTGACCAC CATCAGCCCG
GATGATCTAG ACATTCCACC GATCCAAGAG CGCACGACCG CCGGCAGCCG CTCATCCAAG
GCAATTTTCA CGCCATCTGA CCTGGAAAAG ATATTTTCGG GCTGGATCTA CCAGGGCGAC
ATCCCCAGGC GAACCAAGGC ATATCCGTTC TGGTACTGGT TGCCACTGGT CGCCTATTTC
ACCGGCGCAC GCACCGGCGA AATCACCCAG CTCGACACGG CCGACATCCG GGCTATCAAC
GGCCACCCGT GCTTCGATTT TTGTGAGGAC GACCCGAAAG CCTTCGAGGC CAAACGGATC
AAGACAGGGG AAGCCCGCCA AGTTCCGATT CATCCTTGCC TAATCGAACT TGGCTTTCTC
GACTATGTGG CCAGCCAAGC CCAGGACAGG CAGAAGAAAC TATTCGGCGA CGGGCTGACC
TACATGGAGC CCCGGCACGA CACGGACGCC AACAAAGAGG GCTGGACAAA GCGCGCCGGG
AAGTTCTTCA ACGAAGCGCC AGACGGCTAT CTGGTTACAA CTGGCGTCCA CCAGCCAAGG
GACGGCAAGT CGATTTATTC ATTCCGGCAC ACACTGGTAA CAACTCTCAG GAACGCCGAG
CGCGGCGGTC AGGAGCTGAA ACAGACCCTT ATCAACGCCA TTACCGGACA CAGGGAAAAA
GACGTGCAAG GCCGTCATTA CGACAACGGC CCAACTATCG AACTCAAACT TGACGCGCTC
TTGTTGATGC CGGTCCCGGA AGCTATCCAG CGGCTCAAGG GCTACAAACC TGACTTCGTG
GACCGCTTCG GCGACACTCT GACCAAGAGT ATCGCTAGCC ACCGTCGCAA GTACCCACGC
ACGATATGA
 
Protein sequence
MKPSPSYLTK NRHGTFYFRM VIPAPLRTLI NSKREVRRSL KTDSRRLAIK RARQFAVRYE 
TAFDKAISSM TTTRDGDDVL TEEDIKLLEE LDLPAAGAWS DQPSNTPPEP ILTDEQIEAR
QRRREVERLL AGAYGRAIPT DQEPLASRLL ELSKPYQPTE LRQILPRLRD ELIKSAIAPA
PAPAPAPTFD PAMADWTLYQ VWQHQLERDR ADIAATGGQA RHGGTLEERE RRARVMTVLT
QHKPVCQLSK RDWQAAYDAA RRMKAGVTVS VAPDPQTPLA ELLTDDPALM TGHERTTAVI
ASIKQLQTYA RFLELTTISP DDLDIPPIQE RTTAGSRSSK AIFTPSDLEK IFSGWIYQGD
IPRRTKAYPF WYWLPLVAYF TGARTGEITQ LDTADIRAIN GHPCFDFCED DPKAFEAKRI
KTGEARQVPI HPCLIELGFL DYVASQAQDR QKKLFGDGLT YMEPRHDTDA NKEGWTKRAG
KFFNEAPDGY LVTTGVHQPR DGKSIYSFRH TLVTTLRNAE RGGQELKQTL INAITGHREK
DVQGRHYDNG PTIELKLDAL LLMPVPEAIQ RLKGYKPDFV DRFGDTLTKS IASHRRKYPR
TI