Gene Franean1_2127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2127 
Symbol 
ID5670527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2553997 
End bp2555337 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content75% 
IMG OID641241048 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_001506469 
Protein GI158313961 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCCCAA AGTATCGCCG TTGCGATCGA CGTAGGTACG TGAATCGCGC CGATACGCTC 
GGCTATGTGG TCAGTGAGTG GTATGACCCT GCCGACCTGG CGCGACGGGT CACAGAGCAG
GACAAGGCTA CACCGGGCGA ACGCACCCCG TTCGAGCGGG ATCGTGCCCG GGTGCTCCAT
TCGAGCGCCT TGCGGCGACT TGCCGGAAAA ACCCAGGTCG TGGGCCCGCT CGACGACGAC
TTCCCGCGCA CCCGGCTGAC CCACTCGCTC GAGACGGCGC AGATCGGGCG CGGTCTGGCC
CGCTCGCTGG GCGCCGATCC CGACCTGGTG GACGCCGCCT GCCTCGCCCA CGACATCGGG
CATCCCCCGT TCGGTCACAA CGGAGAGGTG GCGCTCGACC AGGCGGCGCA CACCTGCGGC
GGGTTCGAGG GCAACGCCCA GAGCCTGCGC GAGCTGACCC GGCTCGAGGT GAAGATCGTC
GCGCCCGCCG GGGAGACCGG CGGCGGCGCG GCCGGCGGCG GCGCGGGGCT GAACCTGACC
CGGGCGACCC TGGACGCCGC CGTGAAGTAC CCGTGGCTGC GCCGCGCCGG CACGCCGAAG
TTCGGGGCCT ACGCCGACGA CGCCGGCATC CTCTCCTGGG TGCGCCGCGA CGCCCCCGGC
GCGCGGCGCA GCTTCGAGGC CCAGCTCATG GACTGGGCGG ACGACGTGGC CTACTCCGTG
CACGACCTGG AGGACGGCGT CGTCGCCGGG CACATCGACC TCGCCGCGCT GCGTGACCCC
GAGCTGCGTG CCGAGCTGGC CGCCCGGACG GCGGCCTGGT ACCCGGACGT CGACGCGCCC
GCCGCGGCGG CCGGGCTCGA CCGGCTGCGC GCCCAGCCGT GGTGGATCCG GGAGGAGGTC
GGGTCCGTCG CCGGCCTGGC CGCGCTGCGC GCGATGACCA GCGAACTCGT CGCCCGGTTC
TCGATGGCCG CGGTGCGGGC CACCCGCGAG CGGCACGGAG ATGAGCCGCT GCGCCGCTAC
CGGGCCGACC TGGTCGTGCC GGTGGAGACG CTCGCCGAGT GCGCGGCGCT CAAGGGCGTC
ACCGCGTGGT ACGTGATGGG CCGGCCCGGC GCCGCCGAGC GGCGGGCCCG GCAGCGCGAG
CTGATCGCCG AGCTGGTCGA CCTGCTCGCG GCCAGCGCCC CGGCGTCGTT GGACGCGCCG
CTGGCCGACT CCTACCGGCA CGCGGCCGAC GACGCGGCCC GGCTGCGGGT GGTCATCGAC
CAGGTCGCCC GGCTCACCGA CGCCAGCGCC GGGCGCCGGC ACGCGGCACT GACGGGCCGC
CCGCTGCCGA GGGCGCTGTG A
 
Protein sequence
MRPKYRRCDR RRYVNRADTL GYVVSEWYDP ADLARRVTEQ DKATPGERTP FERDRARVLH 
SSALRRLAGK TQVVGPLDDD FPRTRLTHSL ETAQIGRGLA RSLGADPDLV DAACLAHDIG
HPPFGHNGEV ALDQAAHTCG GFEGNAQSLR ELTRLEVKIV APAGETGGGA AGGGAGLNLT
RATLDAAVKY PWLRRAGTPK FGAYADDAGI LSWVRRDAPG ARRSFEAQLM DWADDVAYSV
HDLEDGVVAG HIDLAALRDP ELRAELAART AAWYPDVDAP AAAAGLDRLR AQPWWIREEV
GSVAGLAALR AMTSELVARF SMAAVRATRE RHGDEPLRRY RADLVVPVET LAECAALKGV
TAWYVMGRPG AAERRARQRE LIAELVDLLA ASAPASLDAP LADSYRHAAD DAARLRVVID
QVARLTDASA GRRHAALTGR PLPRAL