Gene Franean1_5671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5671 
Symbol 
ID5673998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6884321 
End bp6885430 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content72% 
IMG OID641244525 
Productputative agmatinase 
Protein accessionYP_001509928 
Protein GI158317420 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01227] formimidoylglutamase
[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCATGG CACAGGCCGG CGATCGGGAC GTGGCACGCG AGCGGGGGAG CGCCGGTCCG 
GTGCCGCCGT ACGCGGCGCG GGCCGCGGGA GCGGACGATC CCGGCGCGGA CGCCCCGGGA
TACTCCGGCC TGGCGACCTT CGCCGGTCTG CCCTGGATGC CGGGACTCGG CGATCTTCGG
GCGCGCCGGC CCGATGTCGC TGTGGTCGGG GCTCCGTTCG ACATCGCGAC CACGCACCGG
CCGGGCGCGC GTTTCGGCCC GCGGGCGCTG CGCGCTCAGG CGTACAACCC TGGCACATAT
CATCTTGATC TTGGTATAGA GATCTTTGAC TGGCTGGACG TCGTTGACGC GGGGGACGCG
CACTGCCCGC ACGGGCTGAC CGAGGTCTCA CATCGCAACA TCCGGGCGAA GGTCGGCGAC
GTCGCGCGGC TCGGCGTCAT CCCAGTGATC ATCGGGGGGG ACCACTCGAT CACCTGGCCG
GCGGCTAGCG GGGTCGCCGA GGCGGTGGGC TGGGGTGAGG TCGGCCTGCT GCACTTCGAC
GCCCACGCCG ACACGGCCGA CATCATCGAC GGGAACCTGG CCTCGCACGG GACGCCTATG
CGGCGGCTCA TCGAGTCGGG GGCGGTGCGC GGGCGCAACT TCGTCCAGGT GGGGCTGCGC
GGGTACTGGC CTCCGCCGGA CGTGTTCGCG TGGATGCGCG AGCAGGGTAT GCGCTGGCAC
CTGATGCACG AGATCTGGGA GCGGGGGAGC CGAGAGGTGG TCGCCGAGGC GATCGCGCAG
GCGGTGGACG GCTGCCGCGC GCTCTACCTG TCGGTCGACA TCGACGTCCT CGACCCGGGG
TTCGCGCCTG GGACGGGCAC CCCCGAGCCG GGCGGCATGA ACCCGGCCGA CCTGTTGCGG
GCCGTGCGGC AGATCGCGCT GGACACGCCG ATCGTCGCCG CGGACATCGT CGAGGTCTCG
CCTCCGTACG ACCACGCGGA GACGACGGTG AACAGCGCGC ACCGGGTCGC GATGGAGATT
TTCGCGGCGT TGGCGCATCG CCGCCGTAGC GCGGCCGGTG GGACGGCGGA CCTTCCCGCG
GGGCTCCCGA AGGCGAAAGC TGGGTCTTGA
 
Protein sequence
MCMAQAGDRD VARERGSAGP VPPYAARAAG ADDPGADAPG YSGLATFAGL PWMPGLGDLR 
ARRPDVAVVG APFDIATTHR PGARFGPRAL RAQAYNPGTY HLDLGIEIFD WLDVVDAGDA
HCPHGLTEVS HRNIRAKVGD VARLGVIPVI IGGDHSITWP AASGVAEAVG WGEVGLLHFD
AHADTADIID GNLASHGTPM RRLIESGAVR GRNFVQVGLR GYWPPPDVFA WMREQGMRWH
LMHEIWERGS REVVAEAIAQ AVDGCRALYL SVDIDVLDPG FAPGTGTPEP GGMNPADLLR
AVRQIALDTP IVAADIVEVS PPYDHAETTV NSAHRVAMEI FAALAHRRRS AAGGTADLPA
GLPKAKAGS