Gene Franean1_6473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6473 
Symbol 
ID5674788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7868252 
End bp7869970 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content74% 
IMG OID641245321 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001510716 
Protein GI158318208 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0459133 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCGT CCACGCCGAC GGCGGTGCCC TCGGGCACGA GCCTCGGCCG GCTGGCCGAG 
CTGTCCATCG AGCGGACCGG CGGCACCGGT CCGCTGATCT TCGAGGAGCG GCGCTGGACG
GCCGCGCAGC TCGCGGCCCG GGCCCGGCGC CTGGCCGGCG GGCTGCGCGC GGCCGGGCTC
GTGCCGGGGG ACAGGGTCGC GGTCTGCATG GCGAACTGTC CCGAGGTCGG CATCACCTAC
CAGGCGGCCT GGTGGGCCGG CGCGGCCGTC ACGCCGGTGC TGTTCCTGCT CGGGGAGACG
GACCTGCGTC ACGTGCTGGC CGACAGCGCC GCCTCGTTCG TCGTGACCAC CCCGGACTTC
CTGGACAAGG TCCGCGCGGC GGCCCGGGGC CTGCCCGCGC TGCGTGCCGT CGTGCTGGCC
GAGCAGGCCG AGCCCGCGCC CGCCGACCGT GCCGGGCCAC CTGTGCTGTT GTTCGCCGAG
CTGGAGTCCG CGGCCGAGTC CGACCTGGTG GACGTCGACC CGTCCGGTAT GGCCGCGCTG
CTCTACACCG GGGGGACGAC GGGGCGCGCC CGGGGCGTGG TGCTCTCCCA TGACAACGTC
TCGGCGGCCG CGTGGGCGGT GCACTCGATG CGGTTGGGCG AGGGCCTGCC CGGCCTGCTG
CCCCTGCCGA TGTCGCATGT CTACGGGATG ACCGTGAGCG TCATGGCCAC CTACGCCGAG
ACGCCGGCGA CGGCTGTCCT GATGCGCTGG TTCGAGCCCG TCCGGTTCCT GGAGCTGGTG
GTGGAGCACC AGGTGGCGCA GACGGCGATC GTCCCCGCCA TGGCCCGGAT GATCCTGGAC
CAGGATCTCG ACGGCTACGA CCTGTCGGCG CTGCGCCAGG TGGTCTCCGG GAGCTCGGCG
CTGCCGCGTG AGGTGGCCGA CGAATGGGCG CGCCGGCTGC CCGGGGTCGA GCTCGTGGAG
GGCTACGGCT GCACCGAGGC GTCGGCGATC GTCACGGTGA TGCCACCGGG GCGGACCCGG
CTGGGCAGCG TCGGCCGCCC GGCACCCGGC GTCGAGCTCC GGATCGAGGC CCTGAACGCC
GCCGACGGCT ACCATGACGG CCCGCCCGGG GAAGCGGGTG GTTCGCCGGT CGGGGAGATC
TGCGTGCGCG GGCCGGGCGT CATGCTGGGC TACTGGCGTG ATCCTGCGGC GACCGCGCAG
GCGGTCCGCG CCGGATGGCT GCACACCGGC GACGTCGGCC GCCTTGACCG GGACGGCTTC
CTGTACCTAG TCGACCGGAT GAAGGATCTG ATCATCCGAG GCGGGTTCAA CATCTACCCG
CGGGACGTCG AGGACGCGCT GCGGGAGCAC CCGGACATCG CCGAGGTGGC CGTGATGGGC
CGCCCCGACC GCCGGCTCGG CGAGGAGATC GTCGCGTTCG TCCAGCTCGG CCTGGGCACG
GACGTCTCGG CGGAGGCACT TGTCCGGTTC GGGCGGGAGC GGCTCGGGCC GCTGCGGTAC
CCGCGTGAGG TGCGGATCGT CCTGGCGATC CCGCTCACCA GCATGCTCAA GACAGACCGG
GCGGCCCTGC GGGCGATGCT CACGCCGTCA GTTCCGCCGG CTTCGCTGGC GCCGTCGGCG
CACGGCTCGT CACCCTCGTC GCCCCGGTCG CCAGGGAATT CCGACTCCGT CGCGCGGCAC
GGCGGGCCTG TTGGACCGGT GCCGCGAGTT TCGTCGTAG
 
Protein sequence
MIASTPTAVP SGTSLGRLAE LSIERTGGTG PLIFEERRWT AAQLAARARR LAGGLRAAGL 
VPGDRVAVCM ANCPEVGITY QAAWWAGAAV TPVLFLLGET DLRHVLADSA ASFVVTTPDF
LDKVRAAARG LPALRAVVLA EQAEPAPADR AGPPVLLFAE LESAAESDLV DVDPSGMAAL
LYTGGTTGRA RGVVLSHDNV SAAAWAVHSM RLGEGLPGLL PLPMSHVYGM TVSVMATYAE
TPATAVLMRW FEPVRFLELV VEHQVAQTAI VPAMARMILD QDLDGYDLSA LRQVVSGSSA
LPREVADEWA RRLPGVELVE GYGCTEASAI VTVMPPGRTR LGSVGRPAPG VELRIEALNA
ADGYHDGPPG EAGGSPVGEI CVRGPGVMLG YWRDPAATAQ AVRAGWLHTG DVGRLDRDGF
LYLVDRMKDL IIRGGFNIYP RDVEDALREH PDIAEVAVMG RPDRRLGEEI VAFVQLGLGT
DVSAEALVRF GRERLGPLRY PREVRIVLAI PLTSMLKTDR AALRAMLTPS VPPASLAPSA
HGSSPSSPRS PGNSDSVARH GGPVGPVPRV SS