Gene Franean1_1744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1744 
Symbol 
ID5670146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2083956 
End bp2085491 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content75% 
IMG OID641240662 
Productargininosuccinate lyase 
Protein accessionYP_001506088 
Protein GI158313580 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00814038 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATCCCG AACGGGCACA GGGCGACGTC GCGCGCACGG ACGCGGCGCG CACGGACACC 
GCCGACGACA CCGACGGAGA GGACACCGGG GGGCACGCCG GCGCGGGTGG GCAGGCGGAC
GCCCCCGCGT CGACGCCGCC GCTGCGGCTG TGGGGCGGGC GGTTCGCGGG CGGGCCGGCC
GAGGCGCTGG CGCGGCTGTC GGTCAGCGTG CAGTTCGACT GGCGCCTGGC GCCGTACGAC
CTGCTCGCCT CCCGCTCACA CGCCCGCGTC CTGCACCGCG CCGGCCTGCT GGACGACGCC
GAGCTGACGG CGATGCTCGG CGCACTCGAC GACCTGTCGG ACGCGGTCGC CCACGGCCGC
TTCCGCCCGA CCATCGAGGA CGAGGACGTC CACACCGCGC TCGAGCGGGG GCTGCTCGAG
CGCCTCGGCG CGCTCGGGGG CAAGCTCCGG GCCGGCCGGA GCCGCAACGA CCAGGTCGCG
ACCGACCTGC GCCTGTACCT GCGCGATCAC GCGCGTCAGG TCGCCGCGCG GGTCACCGAG
CTCTCCACCG CCCTCGTCGG GCTGGCCGAG CAGCACGTCG AGACCCCGGC ACCGGGGATG
ACCCACCTGC AGCACGCCCA GCCGATCTCG TTCGGGCACC AGCTGCTGGC GCACGTCCAC
GCGTTCGCCC GGGACACCGA CCGGCTGCGG GACTGGGACC GGCGTGCCTC GGTGAGCGCG
CTCGGCGCCG GCGCGCTCGC CGGCTCCTCA CTGCCGCTGG ACCCGGCGGG GGTGGCCGCC
GAGCTCGGCT TCGACCGGGC CTTCGCCAAC TCGCTCGACG CGGTGTCCGA CCGGGACTTC
GCCGCCGAGT TCCTCTTCAT CGCCGCGCTG ATCGGGGTGC ACCTGTCCCG GCTCGGTGAG
GAGATCGTCC TGTGGACGAC CCGGGAGTTC GGCTGGGTCG AGCTTGACGA CGCCTTCGCC
ACCGGCAGCT CGATCATGCC GCAGAAGAAG AATCCGGACG TGGCCGAGCT GGCCCGCGGC
AAGTCGGGCC GGCTCATCGG CGCGCTCACC GGGCTGCTGA CCACCCTCAA GGGCCTGCCG
CTCGCCTACG ACCGCGACCT GCAGGAGGAC AAGGAGCCGG TGTTCGACGC GGTCGACACC
CTCCTCGTTG TGCTGCCGGC GGTGACCGGC ATGGTCGCGA CGATGCGGGT GCGCCGGGAA
CGGCTCGCGG CCGCCGCGCC GGACGGGTTC GCGCTGGCGA CGGACGTGGC GGAGTACCTC
GTCCGCAACG GGGTCGCCTT CCGGGAGGCA CACGAGGCCG TCGGGCAGCT CGTGGCCTGG
TGTGTGGCCC ACGACGCCGA CATGGACGAG GTCTCCGAGG ACGATCTCGC GGTCATCAGC
CCACTGCTCA CCGCCGACGT CCGATCGGTG CTCTCGGTGC GTGGTGCGCT CGAGGCACGC
TCGGCACCGG GCGGGACGGC TCCCGCGCGC GTCCGGGAGC AGATCGAGGC GCTCGGGCCG
GTGCTCGACC GGGACCGGGC GTGGGCCGGG AGCTGA
 
Protein sequence
MDPERAQGDV ARTDAARTDT ADDTDGEDTG GHAGAGGQAD APASTPPLRL WGGRFAGGPA 
EALARLSVSV QFDWRLAPYD LLASRSHARV LHRAGLLDDA ELTAMLGALD DLSDAVAHGR
FRPTIEDEDV HTALERGLLE RLGALGGKLR AGRSRNDQVA TDLRLYLRDH ARQVAARVTE
LSTALVGLAE QHVETPAPGM THLQHAQPIS FGHQLLAHVH AFARDTDRLR DWDRRASVSA
LGAGALAGSS LPLDPAGVAA ELGFDRAFAN SLDAVSDRDF AAEFLFIAAL IGVHLSRLGE
EIVLWTTREF GWVELDDAFA TGSSIMPQKK NPDVAELARG KSGRLIGALT GLLTTLKGLP
LAYDRDLQED KEPVFDAVDT LLVVLPAVTG MVATMRVRRE RLAAAAPDGF ALATDVAEYL
VRNGVAFREA HEAVGQLVAW CVAHDADMDE VSEDDLAVIS PLLTADVRSV LSVRGALEAR
SAPGGTAPAR VREQIEALGP VLDRDRAWAG S