Gene Avin_11540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_11540 
Symbol 
ID7760096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1104157 
End bp1105377 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content74% 
IMG OID643804056 
Productexonuclease subunit SbcD 
Protein accessionYP_002798358 
Protein GI226943285 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.341615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCTCC TGCACACCTC CGACTGGCAC CTCGGCCAGC ACTTCATGGG CAAGACCCGC 
CAGGCCGAGC ACCGGGCCTT CTGCGACTGG CTCGTCGAGC GGGTGCGCGA GCACGCGGTC
GACGCGCTGA TCGTCGCCGG CGACCTGTTC GACAGCGGCG CGCCGCCCAG TCATGCCCGC
GAGCAGTACA ACCGCTTCAT CGTCGCGCTG CGCGCGACCG GCGCCCGCCT GGTGGTGCTC
GGTGGCAACC ACGATTCGGT GGCCATGCTC GGCGAGTCGC GCGGTCTTCT GGCCTGCCTG
GACACCTGGG TGATTCCGGG CGTGGCGGCG GACCCGGCCG AGCAGCTCCT GCTGTTGCCG
CGGCGCGACG GTGCGCCGGG CGCGTTGCTC TGCGCCATCC CCTTCATCCG TCCGCGCGAC
GTGCTGAAAA GCGAGGCCGG GCAGAGCGCG GACGCCAAGC TGCAGGCGCT GCAGGCGGCG
ATCCGCGAAC ACTACCGGGC GCTGTTCGCC CTCGCCGAGG CGCGTCGCCG CGAGCTGGGC
GGCGCCCTGC CGATCGTCGC CACGGGGCAC CTGACCACCG TCGGCGCCAG TGCCAGCGAA
TCGGTGCGGG AGATCTACGT CGGCAGCCTG GAGGCTTTCC CGACCGATGC CTTCCCGCCG
GCGGCCTATG TCGCCCTCGG CCATATCCAT CGCCCGCAGC AGGTCGCCGG GCTGGAGCAC
ATCCGCTACA GCGGCTCGCC GATCCCGCTG TCCTTCGACG AGGCGCGCCA GTGCAAGGAG
GTGTTGCTGG TCGACCTGGG CGAGGACGGC CTCGAGGCGG TGACGCCGCT GCCAGTGCCC
TGTTTCCAGC CGCTGCTCAC GCTGCGCGGC GATCTCGCCG AGCTGGCCGG CGCCGTTGTC
GAGGCGGCCG CCGGGGGTAG CGCCGAGCGT CCGGTGTGGC TGGAGGTCCG GGTCGTCGCC
GACGAGCACC TGCCCGACCT GCCGGCGCGC GTCGCCGCCC TTTGCGCGGG GCTGCCGGTG
GAGGTGCTGC GCATCCGCCG CGAGCGCGGC GACGCAGTCG CCCGCCTGTG CCGCGAGGCA
CGGGAAACCC TCGACGAACT GAGCCCCGAG GAGGTGTTCG AACAGCGCCT GGCCGGCGCG
GCACTGGACG AGGCGCTGGC CGGGCGCCTG CGCGGCCTGC ACCGCCAGGT GCTCGACGAG
CTGCGCGAGG AGCGGGCGTG A
 
Protein sequence
MRLLHTSDWH LGQHFMGKTR QAEHRAFCDW LVERVREHAV DALIVAGDLF DSGAPPSHAR 
EQYNRFIVAL RATGARLVVL GGNHDSVAML GESRGLLACL DTWVIPGVAA DPAEQLLLLP
RRDGAPGALL CAIPFIRPRD VLKSEAGQSA DAKLQALQAA IREHYRALFA LAEARRRELG
GALPIVATGH LTTVGASASE SVREIYVGSL EAFPTDAFPP AAYVALGHIH RPQQVAGLEH
IRYSGSPIPL SFDEARQCKE VLLVDLGEDG LEAVTPLPVP CFQPLLTLRG DLAELAGAVV
EAAAGGSAER PVWLEVRVVA DEHLPDLPAR VAALCAGLPV EVLRIRRERG DAVARLCREA
RETLDELSPE EVFEQRLAGA ALDEALAGRL RGLHRQVLDE LREERA